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(57) Abstract: The invention relates to nucleic acid molecules, 
proteins encoded by such nucleic acid molecules; and use of the 
proteins and nucleic acid molecule. 
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TITLE : Novel Kallikrein Gene 
FIELD OF THE INVENTION 
5 The invention relates to nucleic acid molecules, proteins encoded by such nucieic acid molecules; and 

use of the proteins and nucleic acid molecules. 
BACKGROUND OF THE INVENTION 

Kallikreins are a group of serine proteases that are found in diverse tissues and biological fluids. The 
term "kallikrein" was first introduced by Werle and colleagues who found high levels of their original isolates 

10 in the pancreas (in Greek, the "kallikreas") (1,2). BCallikreins are divided into two main groups; the plasma 
kallikrein, which is a single gene (3), and the tissue kallikreins, which are encoded by a large multi-gene 
family in rodents (4,5). Until recently, the human kallikrein gene family was thought to consist of only three 
members (6). However, 11 new members of the kallikrein gene family have been identified (7-18). The 
progress in this area of investigation has recently been reviewed (7). 

1 5 Prostate specific antigen (PSA), currently the most useful tumor marker for prostate cancer diagnosis 

and monitoring, is a member of the human kallikrein gene family of serine proteases (19,20). In addition to 
PSA, human glandular kallikrein 2 (hK2, encoded by the KLK2 gene) has been proposed as an adjuvant 
diagnostic marker for prostate cancer (21,22). Moreover, accumulating evidence indicates that other members 
of the expanded kallikrein gene family may be associated with malignancy (7). The normal epithehal cell- 

2 0 specific 1 gene (NESl) (KLKl 0, according to the approved human tissue kallikrein gene nomenclature) was 

found to be a novel tumor suppressor, which is down-regulated dming breast cancer progression (23). Other 
gene ^ily members, including zyme (KLK6), neuropsin (KLK8), and himian stratum comeum chymotyrptic 
enzyme (HSCCE; KLK7) were also found to be differentially expressed in certain types of malignancies (24- 
26). 

25 SUMMARY OF THE INVENTION 

The present inventors identified a nucleic acid molecule encoding a novel kallikrein. The nucleic acid 
molecule maps to chromosome 19ql3.3-ql3.4 and is located between the klkl and IdkS genes. The novel 
nucleic acid molecule designated "klklS" has three alternatively spliced forms and is primarily expressed in 
the thyroid gland, and to a lower extent in the prostate, salivary and adrenal glands, colon, testis, and kidney. 

3 0 The expression of the nucleic acid is up-regulatedin prostate cancer and it is under steroid hormone regulation 

in the LNCaP prostate cancer cell line. Higher expression of klkl 5 is associated with more aggressive (higher 
stage and higher grade) prostate tumors. 

The novel kallikrein protein described herein is refened to as "Kallikrein 1 5", "KLKl 5", or **KLK1 5 
Protein". The gene encoding the protein is referred to as "klkl5'\ 
35 Broadly stated the present invention relates to an isolated nucleic acid molecule of at least 30 

nucleotides which hybridizes to one or more of SEQ. ID. NO. 1 through 5, or 10 through 24, or the 
complement of one or more of SEQ ID NO. 1 through 5, or 10 through 24 under stringent hybridization 
conditions. 

The invention also contemplatesa nucleic acid molecule comprising a sequence encoding a truncation 
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of a KLK15 Protein, an analog, or a ho'molog of a KLK15 Protein or a truncation thereof. (KLK15 Protein 
and truncations, analogs and homologs of KLK15 Protein are also collectively referred to hwein as " KLK15 
Related Proteins"). 

The nucleic acid molecules of the invention may be inserted into an appropriate expression vector, 
5 i.e. a vector that contains the necessary elements for the transcription and translation of the inserted coding 
sequence. Accordingly, recombinant expression vectors adapted for transformation of a host cell may be 
constructed which comprise a nucleic acid molecule of the invention and one or more transcription and 
translation elements linked to the nucleic acid molecule. 

The recombinant expression vector can be used to prepare transformed host cells expressing KLK15 
1 0 Related Proteins. Therefore, the invention further provides host cells containing a recomibinant molecule of the 
invention. The invention also conten^lates transgenic non-human mammals whose germ cells and somatic cells 
contain a recombinant molecule comprising a nucleic acid molecule of the invention, in particular one which 
encodes an analog of tiie KLK15 Protein, or a truncation of tiie KLK15 Protein. 

The invention further provides a method for preparing KLK15 Related Proteins utilizing the purified 
15 and isolated nucleic acid molecules of the invention. In an embodiment a method for preparing a KLK15 
Related Protein is provided comprismg (a) transferring a recombinant expression vector of tiie invention into 
a host cell; (b) selecting transformed host cells from untransformed host cells; (c) culturing a selected 
transformed host cell under conditions which allow expression of the KLK15 Related Protein; and (d) isolating 
tiie KLK15 Related Protein. 

2 0 The invention fiirtiier broadly contemplates an isolated KLK15 Protein conq)rising an amino acid 

sequenceofSEQ.ID.NO. 6, 7, 8, or 9. 

The KLKI5 Related Proteins of the invention may be conjugated with otiier molecules, such as 
proteins, to prepare fusion proteins. This may be accomplished, for example, by tiie synttiesis of N-terminal 
or C-terminal fusion proteins. 
25 The invention further conten^lates antibodies having specificity against an epitope of a KLK15 

Related Protein of tiie invention. Antibodies may be labeled witii a detectable substance and used to detect 
proteins of the invention in tissues and cells. Antibodies may have particular use in therapeutic applications, 
for example to react with tumor cells, and in conjugates and inmmnotoxins as target selective carriers of various 
agents which have antitumor effects including chemotherapeutic drugs, toxins, immunological response 

3 0 modifiers, enzymes, and radioisotopes. 

The invention also permits tiie construction of nucleotide probes that are unique to the nucleic acid 
molecules of the invention and/or to proteins of tiie invention. Therefore, tiie invention also relates to a probe 
comprising a nucleic acid sequence of the invention, or a nucleic acid sequence encoding a protein of tiie 
invention, or a part tiiereof . The probe may be labeled, for example, with a detectable substance and it may be 
3 5 used to select from a mixture of nucleotide sequences a nucleic acid molecule of tiie invention including nucleic 
acid molecules coding for a protein which displays one or more of tiie properties of a protein of tiie invention. 
A probe may be used to mark tumors. 
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The invention dso provides antisense nucleic acid molecuies e.g. by production of a nJRNA orDNA 
strand in the reverse orientation to a sense molecule. An antisense nucleic acid molecule naay be used to 
suppress the growth of a KLK15 expressing (e.g. cancerous) cell. 

The invention still further provides a method for identifying a substance that binds to a protein of the 
5 invention comprising reacting the protein with at least one substance which potentially can bind witii the 
protein, under conditions which permit the formation of complexes between the substance and protein and 
detecting binding. Binding may be detected by assaying for con^lexes, for free substance, or for non- 
complexed protein. The invention also contemplates methods for identifying substances that bind to other 
intracellular proteins that interact witii a KLK15 Related Protein. Methods can also be utilized which identify 
1 0 compounds which bind to KI£1S gene regulatory sequences (e.g. promoter sequences). 

Still further the invention provides a method for evaluating a compound for its ability to modulate the 
biological activity of a KLK15 Related Protein of the invention. For example a substance which inhibits or 
enhances the interaction of die protein and a substance which binds to the protein may be evaluated. In an 
embodiment, the method comprises providing a known concentration of a KLK15 Related Protein, with a 
15 substance which binds to the protein and a test compound under conditions which permit the formation of 
complexes between the substance and protein, and removing and/or detecting complexes. 

Compounds which modulate the biological activity of a protein of the invention may also be identified 
using the methods of the invention by comparing the pattern and level of expression of the protein of the 
invention in tissues and cells, in the presence, and in the absence of the compounds. 
2 0 The proteins of the invention, antibodies, antisense nucleic acid molecules, and substances and 

compounds identified using the methods of the invention, and peptides of the invention may be used to 
modulate the biological activity of a KLK15 Related Protein of the invention, and they may be used in the 
treatment of conditions such as cancer (particularly prostate, colon, kidney, and testicular cancer) and thyroid 
disorders in a subject Accordingly, the substances and confounds may be formulated into compositions for 

2 5 admioistration to individuals suffering from disorders such as cancer (particularly particularly prostate, colon, 

kidney, and testicular cancer) and thyroid disorders in a subject In particular, the antibodies, antisense nucleic 
acid molecules, substances and compounds may be used to treat patients who have a KLK15 Related Protein 
in, or on, tiieir cancer cells. 

Therefore, the present invention also relates to a composition comprising one or more of a protein of 
30 tiie invention, or a substance or compound identified using the methods of the invention, and a 
pharmaceutically acceptable carrier, excipient or diluent. A method for treating or preventing a disorder such 
as cancer (particularly prostate, thyroid, colon, kidney, and testicular cancer) and thyroid disorders in a subject 
is also provided comprising administering to a patient in need thereof, a KLK15 Related Protein of the 
invention, a substance or compound identified using the methods of the invention, or a composition of the 

3 5 invention. 

Another aspect of the invention is the use of a KLK15 Related Protein, peptides derived therefrom, 
or chemically produced (synthetic) peptides, or any combination of diese molecules, for use in the preparation 
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of vaccines to prevent cancer and/or to treat cancer, in particular to prevent and/or treat cancer in patients who 
have a KLK15 Related Protein detected on their cells. These vaccine preparations may also be used to prevent 
patients from having tumors prior to their occurrence. 

The invention broadly contemplates vaccines for stimulating or enhancing in a subject to whom the 
5 vaccine is administered production of antibodies directed against a KLK15 Related Protein. 

The invention also provides a method for stimulating or enhancing in a subject production of 
antibodies directed against a KLK15 Related Protein. The method comprises adnunisteiing to the subject a 
vaccine of the invention in a dose effective for stimulating or enhancing production of the antibodies. 

The mvention further provides methods for treating, preventing, or delaying recurrence of cancer. The 
10 methods comprise administering to the subject a vaccine of the invention in a dose effective for treating, 
preventing, or delaying recurrence of cancer. 

In other embodiments, the invention provides a method for identifying inhibitors of a KLK15 Related 
Protein interaction, comprising 

(a) providing a reaction mixture including the KLK15 Related Protein and a substance that binds 
15 to die KLK15 Related Protein, or at least a portion of each which interact; 

(b) contacting the reaction mixture with one or more test compounds; 

(c) identifying conq)ounds which inhibit the interaction of the KLK15 Related Protein and 
substance. 

In certain preferred embodiments, the reaction mixture is a whole cell. In other embodiments, the 
2 0 reaction mixture is a cell lysate or purified protein composition. The subject method can be carried out usmg 
libraries of test compounds. Such agents can be proteins, peptides, nucleic acids,. carbohydrates, small organic 
molecules, and natural product extract libraries, such as those isolated from animals, plants, fungus and/or 
microbes. Still another aspect of the present invention provides a method of conductmg a drug 

discovery business comprising: 

2 5 (a) providing one or more assay systems for identifying agents by their ability to inhibit or 

potentiate the interaction of a KLK15 Related Protein and a substance that binds to the protein; 

(b) conducting therapeutic profiling of agents identified in stsp (a), or further analogs tiiereof, 
for efficacy and toxicity in animals; and 

(c) formulating a pharmaceutical preparation including one or more agents identified in step (b) 
30 as having an acceptable therapeutic profile. 

In certain embodiments, the subject method can also include a step of establishing a distribution 
system for distributing the pharmaceutical preparation for sale, and may optionally include establishing a sales 
group for marketing the pharmaceutical preparation. 

Other objects, features and advantages of the present invention will become apparent from the 

3 5 following detailed description. It should be understood, however, that the detailed description and the specific 

examples while indicating prefixed embodiments of the invention are given by way of illustration only, since 
various changes and modifications witiiin the spirit and scope of the invention will become apparent to those 
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skUled in the art from this detailed description. ' ' . 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention will now be described in relation to the drawings in which: 

Figure 1 shows the genomic organization and partial genomic sequence of the KLK15 gene. Intronic 
5 sequences are not shown except for the splice junction areas. Introns are shown with lower case letters and 
exons with capital letters. The coding nucleotides are shown in bold and the 3 ' untranslated region follows the 
TGA stop codon (encircled). The translated amino acids of the coding region are shown underneath by a single 
letter abbreviation. The start and stop codons are encircled and the exon -intron junctions are underlined. The 
catalytic residues are boxed. The putative polyadenylation signal is underlined. The exact start of the first 

1 0 coding exon was not determined. 

Figure 2 shows an alignment of the deduced amino acid sequence of KLK15 with members of the 
kallikrein multi-gene family (SEQ ID NOs. 25-38). Dashes represent gaps to bring the sequences to better 
alignment The residues of the catalytic triad (H, D, S) are shown in italics. Identical amino acids are 
highlighted in black and shnilar residues in grey. The 29 invariant serine protease residues are marked by ( • ) 

15 on die bottom, and the cysteine residues by (+) on top of each block. The predicted cleavage sites of the signal 
and activation peptides are indicated by arrows. The dotted area represents the kallikrein loop sequence. The 
trypsin-like cleavage pattern predicted by the presence of the **D" residue is indicated by (*). KLK15 has an 
•*E" in Ais position. A unique 8 amino acid sequence, HNEPGTAG (SEQ ID NO. 10), is present at positions 
148-155 of the KLK15 gene. 

2 0 Figure 3 is a plot of hydrophobicity and hydrophilicity of the KLK15 protein, as compared with the 

prostate specific antigen (PSA). Note the hydrophobic region at the amino terminus, suggesting presence of 
a signal peptide. 

Figure 4 is a dendrogram of the predicted phylogenetic tree for 15 kallikreins and a few other serine 
proteases. The ndghbor-joining method was used to align KLK15 with other serine proteases and members 
25 of the kallikrein gene family. The tree grouped the classical kallikreins (hKl, hK2, and PSA) together and 
aligned KLK15 m one group with TLSP and KLK-L3 genes. Other serine proteases were aligned in different 
groups, as shown. KLK represents kallikrein; KLK-L represents kallikrein-like; TLSP represents tiypsin-like 
SCTine protease; NESl represents normal epithelial cell-specific gene; PSA represents prostate specific antigen; 
hKl and hK2 represents human glandular kallikrein 1 and 2, respectively; and HSCXIE represents human 

3 0 stratum comeum chymotryptic enzyme. 

Figure 5 is a schematic presentation of the different splice variants of the KLK15 gene. Exons are 
shovra by boxes and introns by the coimecting lines. Numbers inside boxes represent the exon lengths in base 
pairs. The arrowhead points to the common start codon and stars to the stop codon positions. The length of the 
predicted polypeptide product is indicated beside each variant in amino acids (AA). The alternative splicing 
3 5 and/or exon skips create a frame shift, which leads to a premature termination. 

Figure 6 shows the relative locations of KLKl, KLK15, and KLK3 genes on chromosome 19ql3.3- 
ql3.4. The two overlapping BAG clones are identified, and the overlap region is hatched. Genes are 
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represented by horizontal arrows denoting the direction of the coding sequence. Distances between genes are 
mentioned in base pairs. Figure is not drawn to scale. 

Figure 7 shows tissue expression of the KLK15 gene, as determined by RT-PCR, KLK15 is primarily 
expressed in the thyroid gland» and to a lower extent in the prostate, salivary and adrenal glands, colon, testis 
5 and kidney. M= Molecular weight marker. For explanation of the multiple PGR bands (alternatively spliced 
forms) see the Example. PGR was performed with primers KLK15-F2 and KLK15-RL 

Figure 8 shows honnonal regulation of tiie KLK15 gene in tiie LNCaP prostate cancer cell line. DHT 
= dihydiotestosterone. Steroids were added at 10"* M final concentrations, (-ve) = negative control. Actin was 
used as a control gene. 

1 0 Figure 9 is a schematic diagram showing the comparison of the coding regions of the 15 kallikrein 

genes. Exons are shown by solid bars and introns by the connecting lines. Letters above boxes indicate relative 
positions of the catalytic triad that was found to be conserved in all genes; H denotes histidine, D aspartic acid 
and S serine. Roman numbers indicate intron phases. The intron phase refers to the location of the intron 
within die codon; I denotes tiiat the intron occurs after the first nucleotide of the codon, n the intron occurs 

1 5 after die second nucleotide, 0 the intron occurs between codons. The intron phases are conserved in all genes. 
Numbers inside boxes indicate exon lengths in base pairs. Names inside brackets represent the ofticiai 
nomenclature approved by the human gene nomenclature committee. Untranslated 3' and 5' regions and 5' 
untranslated exons are not shown. 
DETAILED DESCRIPTION OF THE INVENTION 

20 In accordance with the present invention there may be employed conventional molecular biology, 

microbiology, and recombinant DNA techniques within the skill of die art. Such techniques are explained fully 
in die literature. See for example, Sambrook, Fritsch, & Maniatis, Molecular Cloning: A Laboratory Manual, 
Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y); DNA Cloning: A 
Practical Approach, Volumes I and n (D.N. GIovct ed. 1985); Oligonucleotide Synthesis (M. J. Gait ed. 1984); 

2 5 Nucleic Acid Hybridization B JD. Hames & S. J. BSggins eds, (1985); Transcription and Translation B.D. Hames . 

& S J. Ifiggins eds (1984); Animal Cell Culture RJ. Freshney, ed. (1986); Immobilized Cells and enzymes IRL 
Press. (1986); and B, P^bal, A Practical Guide to Molecular Cloning (1984). 
1. Nucleic Add Molecules of the Invention 

As hereinbefore mentioned, the invention provides an isolated nucleic acid molecule having a 

3 0 . sequence encoding a KLK15 Protein. The term "isolated" refers to a nucleic acid substantially free of cellular 

material or culture medium when produced by recombinant DNA techniques, or chemical reactants, or other 
chemicals when chemically synthesized. An "isolated" nucleic acid may also be free of sequences which 
naturally flank die nucleic acid (i.e., sequences located at the 5* and 3* ends of die nucleic acid molecule) from 
which die nucleic acid is derived. The term "nucleic acid" is intended to include DNA and RNA and can be 
3 5 either double stranded or single stranded. In an embodiment, a nucleic acid molecule of the invention encodes 
a protein comprising an amino acid sequence of SEQ.ID J^O. 6, 7, 8, or 9 preferably a nucleic acid molecule 
of die invention comprises a nucleic acid sequence of one or more of SEQ.ID JTO. 1 tiarough 5. or 10 through 
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24. 

In an embodiment, the invention provides an isolated nucleic acid molecule which comprises: 

(i) a nucleic acid sequence encoding a protein having substantial sequence identity with an 
amino acid sequence of SEQ. ID. NO. 6, 7, 8, or 9; 

5 (ii) a nucleic acid sequence encoding a protein comprising an amino acid sequence of SEQ. ED. 

NO. 6, 7, 8, or 9; 

(iii) nucleic acid sequences complementary to (i) or (ii); 

(iv) a degenwate form of a nucleic acid sequence of (i) or (ii); 

( v) a nucleic add sequence capable of hybridizing under stringent conditions to a nucleic acid 
1 0 sequence in (i), (ii) or (iii); 

(vi) a nucleic acid sequence encoding a truncation, an analog, an allelic or species variation of 
a protein comprising an amino acid sequence of SEQ. ID. NO. 6, 7, 8, or 9; or 

(vii) a fragment, or allelic or species variation of (i), (ii) or (iii). 

Preferably, a purified and isolated nucleic acid molecule of the invention comprises: 
15 (i) a nucleic acid sequence comprising the sequence of one or more of SEQ.ID JNfO. 1 through 

5 or 10 through 24,wherein T can also be U; 

(ii) nucleic acid sequences complonentary to (i), preferably complementary to the full nucleic 
acid sequence of one or more of SEQ.ID J^O. 1 through 5 or 10 through 24; 

(iii) a nucleic acid capable of hybridizing under stringent conditions to a nucleic add of (i) or (ii) 
2 0 and preferably having at least 18 nucleotides; or 

(iv) a nucleic acid molecule differing from any of the nucleic acids of (i) to (iii) in codon 
sequences due to the degeneracy of the genetic code. 

The invention includes nucleic acid sequences complementary to a nucleic acid encoding a protein 
comprising an amino add sequence of SEQ.ID.NO, 6, 7, 8. or 9 preferably the nucleic acid sequences 

2 5 complementary to a full nucldc acid sequence of one or more of SEQ,IDJNO, I through 5 or 10 through 24. 

The invention includes nucleic acid molecules having substantial sequence identity or homology to 
nucleic acid sequences of the invention or encoding proteins having substantial identity or similarity to the 
amino acid sequence of SEQ.ID.NO. 6, 7, 8. or 9. Prejferably, the nucleic acids have substantial sequence 
identity for example at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, or 85% nucleic acid identity; more 

3 0 preferably 90% nucleic acid identity; and most preferably at least 95%, 96%, 97%, 98%, or 99% sequence 

identity. "Identity" as known in the art and used herein, is a relationship between two or more amino acid 
sequences or two or more nucleic acid sequences, as determined by comparing the sequences. It also refers to 
the degree of sequence relatedness between amino acid or nucleic acid sequences, as the case may be, as 
determined by the match between strings of such sequences. Identity and similarity are well known terms to 
3 5 skilled artisans and they can be calculated by conventional methods (for example see Computational Molecular 
Biology, Lesk. AM. ed., Oxford University Press, New York, 1988; Biocomputing: Infoimatics and Genome 
Projects. Smith, D.W. ed.. Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I. 
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Griffin, A.M. and Griffin, H.G. eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular 
Biology, von Heinje, G. Acadmeic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, 
J. eds. M. Stockton Press, New York, 1991, Carillo, H. and Lipman, D.. SIAM J. Applied Math. 48:1073, 
1988). Methods which are designed to give the largest match between the sequences are generally preferred. 
5 Methods to determine identity and similarity are codified in pubhcly available computer programs including 
the GCG program package (Dev^eux J. et al.. Nucleic Acids Research 12(1): 387, 1984); BLASTP, BLASTN, 
and FASTA (Atschul, S.F. et al. J. Molec. Biol. 215: 403-410, 1990). The BLAST X program is publicly 
available ftom NCBI and other sources (BLAST Manual, Altschul, S. et al, NCBI NLM NIH Bethesda, Md. 
20894; Altschul, S. et al. J. Mol. Biol. 215: 403-410. 1990). 

1 0 Isolated nucleic acid molecules encoding a KLK15 Protein, and having a sequence which differs from 

a nucleic acid sequence of the invention due to degeneracy in the genetic code are also within the scope of the 
invention. Such nucleic acids encode functionally equivalent proteins (e.g. a KLK15 Protein) but differ in 
sequence from the sequence of a KLK15 Protein due to degeneracy in the genetic code. As one example, DNA 
sequence polymorphisms within the nucleotide sequence of a KLK15 Protein may result in silent mutations 

15 which do not affect the amino acid sequence. Variations in one or more nucleotides may exist among 
individuals within a population due to natural allelic variation. Any and all such nucleic acid variations are 
within the scope of the invention. DNA sequence polymorphisms may also occur which lead to changes in the 
amino acid sequence of a KLK15 Protein. These amino acid polymorphisms are also within the scope of the 
present mvration. 

2 0 Another aspect of the invention provides a nucleic acid molecule which hybridizes under stringent 

conditions, preferably high stringency conditions to a nucleic acid molecule which comprises a sequence which 
encodes a KLK15 Protein having an amino acid sequence shown in SEQ.ID.NO. 6, 7, 8, or 9. Appropriate 
stringency conditions which promote DNA hybridization are known to those skilled in the art, or can be found 
in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For example, 6.0 x 

2 5 sodium chloride/sodium citrate (SSC) at about 45**C. followed by a wash of 2.0 x BSC at 50°C may be 

employed. The stringency may be selected based on the conditions used in the wash step. By way of example, 
the salt concentration in the wash step can be selected from a high stringency of about 0.2 x SSC at 50**C. In 
addition, die temperature in the wash step can be at higji stringency conditions, at about 65°C. 

It will be appreciated that the invention includes nucleic acid molecules encoding a KLK15 Related 

3 0 Protein including truncations of a KLK15 Protein, and analogs of a KLK15 Protein as described herein. The 

truncated nucleic acids or nucleic acid fragments may correspond to a sequence comprising or consisting of 
nucleotides 1581-1623, 1524-5258, 5259-5412, 5413-5912, 5913-6078, 6197-6316, 6079-6316, 6317-6453, 
6454-7126, 6079-7126. 7127-7786. 5913-6196, 7127-7131. or 7127-7279 of SBQ ID NO. 1, or SEQ ID NO. 
39, 47, 48, 49, or 50. It will further be appreciated that variant forms of the nucleic acid molecules of the 
3 5 invention which arise by alternative splicing of an mRNA corresponding to a cDNA of the invention are 
encompassed by the invention (See SEQ ID NO. 3, 4, and 5). 

An isolated nucleic acid molecule of the mvention ^ch comprises DNA can be isolated by preparing 
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a labelled nucleic acid probe based on all or part of a nucleic acid sequence of the invention. The labeled 
nucleic acid probe is used to screen an appropriate DNA library (e.g. a cDNA or genomic DNA library). For 
example, a cDNA library can be used to isolate a cDNA encoding a KLK15 Related Protein by screening the 
library with the labeled probe using standard techniques. Alternatively, a genomic DNA library can be similarly 
5 screened to isolate a genomic clone encompassing a gene encoding a KLK15 Related Protein. Nucleic acids 
isolated by screening of a cDNA or genomic DNA library can be sequenced by standard techniques. 

An isolated nucleic add molecule of the invention which is DNA can also be isolated by selectively 
amplifying a nucleic acid encoding a KLK15 Related Protein using the polymerase chain reaction (PGR) 
methods and cDNA or genomic DNA. It is possible to design synthetic oligonucleotide primers from the 

1 0 nucleotide sequence of the invention for use in PGR. A nucleic acid can be amplified from cDNA or genomic 
DNA using these oligonucleotide primers and standard PGR amplification techniques. The nucleic acid so 
amplified can be cloned into an appropriate vector and characterized by DNA sequence analysis. cDNA may 
be prepared from mRNA, by isolating total cellular mRNA by a variety of techniques, for example, by using 
the guanidinium-thiocyanate extraction procedure of Ghirgwin et al., Biochemistry, 18, 5294-5299 (1979). 

1 5 cDNA is then synthesized from the mRNA using reverse transcriptase (for example, Moloney MLV reverse 
transcriptase available from Gibco/BRL, Bethesda, MD, or AMV reverse transcriptase available from 
Seikagaku America, Inc., St. Petersburg, EL). 

An isolated nucleic acid molecule of the invention whidi is RNA can be isolated by cbning a cDNA 
encoding a KLK15 Related Protem mto an appropriate vector which allows for transcription of the cDNA to 

2 0 produce an RNA molecule which encodes a KLK15 Related Protein. For example, a cDNA can be cloned 
downstream of a bacteriophage promoter, (e.g. a T7 promoter) in a vector, cDNA can be transcribed in vitro 
with T7 polymerase, and the resultant RNA can be isolated by conventional techniques. 

Nucleic acid molecules of the invention may be chemically synthesized using standard techniques. 
Methods of chemically synthesizing polydeoxynucleotides are known, including but not limited to solid-phase 

2 5 synthesis which, like peptide synthesis, has been My automated in commercially available DNA synthesizers 

(See e.g.. Itakm-a et al. U.S. Patent No. 4,598,049; Garuthcrs et al. U.S. Patent No. 4,458,066; and Itakura U.S. 
Patent Nos. 4.401,796 and 4.373,071). 

Determination of whether a particular nucleic acid molecule encodes a KLK15 Related Protein can 
be accomplished by expressing the cDNA in an appropriate host cell by standard techniques, and testing the 
30 expressed protein in the methods described herein. A cDNA encoding a KLK15 Related Protein can be 
sequenced by standard techniques, such as dideoxynucleotide chain termination or Maxam-Gilbert chemical 
sequencing, to determine the nucleic acid sequence and the predicted amino acid sequence of the encoded 
protein. 

The initiation codon and untranslated sequences of a KLK15 Related Protein may be determined using 

3 5 computer software designed for tiie purpose, such as PG/Gene (IntelliGenetics Inc., Galif.). The intron-exon 

structure and the transcription regulatory sequences of a gene encoding a KLK15 Related Protein may be 
confirmed by using a nucleic acid molecule of the invention encoding a KLKIS Related Protein to probe a 
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genomic DNA clone library. Regulatory elements can be identified using standard techniques. The function 
of the elements can be confirmed by using these elements to express a reporter gene such as the lacZ gene that 
is operatively linked to the elements. These constructs may be introduced into cultured cells using conventional 
procedures or into non-human transgenic animal models. In addition to identifying regulatory elements in DNA, 
5 such constructs may also be used to identify nuclear proteins interacting with the elements, using techniques 
known in the art. In an embodiment, regulatory sequences of a nucleic add molecule of the invention comprise 
the sequence of SEQ ID NO. 11. 

la a particular embodiment of the invention, the nucleic acid molecules isolated using the methods 
described herein are mutant KLKIS gene alleles. The mutant alleles may be isolated from individuals either 

1 0 known or proposed to have a genotype which contributes to the symptoms of a disorder mvolving a KLK15 
Related Protein. Mutant alleles and mutant allele products may be used in therapeutic and diagnostic methods 
described herein. For example, a cDNA of a mutant KLK15 gene may be isolated using PGR as described 
herein, and the DNA sequence of the mutant allele may be compared to the normal allele to ascertain the 
mutation(s) responsible for the loss or alteration of function of the mutant gene product. A genomic library can 

1 5 also be constructed using DNA from an individual suspected of or known to carry a mutant allele, or a cDNA 
library can be constructed using RNA from tissue known, or suspected to express the mutant allele. A nucleic 
acid encoding a normal KLK15 gene or any suitable fragment thereof, may then be labeled and used as a probe 
to identify the corresponding mutant allele in such libraries. Clones containing mutant sequences can be 
purified and subjected to sequence analysis. In addition, an expression library can be constrocted using cDNA 

2 0 from RNA isolated from a tissue of an individual known or suspected to express a mutant KLK15 allele. Gene 
products made by the putatively mutant tissue may be expressed and screened, for example using antibodies 
specific for a KLK15 Related Protein as described herein. Library clones identified using the antibodies can 
be purified and subjected to sequence analysis. 

The sequence of a nucleic acid molecule of the invention, or a fragment of the molecule, may be 

2 5 inverted relative to its normal presentation for transcription to produce an antisense nucleic add molecule. An 
antisense nucleic acid molecule may be constructed using chemical synthesis and enzymatic ligation reactions 
using procedures known in the art. 
2. Proteins of the Invention 

An amino acid sequence of a KLK15 Protein comprises a sequence as shown in SEQ.ID.NO. 6, 7, 

30 8, or 9. The protein is primarily expressed in the thyroid gland, and to a lower extent in the prostate, saHvary 
and adrenal glands, colon, testis, and kidney. 

In addition to proteins comprising an amino acid sequence as shown in SEQ.ID.NO. 6, 7, 8, or 9 the 
protems of the present invention include truncations of a KLK15 Protein, analogs of a KLK15 Protein, and 
proteins having sequence identity or similarity to a KLK15 Protein^ and truncations thereof as described 

35 herein (i.e. KLK15 Related Proteins). 

Truncated proteins may comprise peptides of between 3 and 70 amino acid residues, ranging in size 
from a tripeptide to a 70 mer polyp^tide. In a preferred embodiment the peptide is HNEPGTAG (SEQ ID NO 
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10). The truncated proteins may have an amino group (-NH2), a hydrophobic group (for example, 
carbobenzoxyl, dansyl, or T-butyloxycarbonyl), an acetyl group, a 9-fluorenyknethoxy-carbonyl (PMOC) 
group, or a macromolecule including but not limited to lipid-fatty acid conjugates, polyethylene glycol, or 
carbohydrates at the amino terminal end. The truncated proteins may have a carboxyl group, an amido group, 
5 a T-butyloxycarbonyl group, or a macromolecule including but not limited to lipid-fatty acid conjugates, 
polyethylene glycol, or carbohydrates at the carboxy terminal end. 

The proteins of the invention may also include analogs of a KLX15 Protein, and/or truncations thereof 
as described herein, which may include, but are not limited to a KLKIS protein, containing one or more amino 
acid substitutions, insertions, and/or deletions. Amino acid substitutions may be of a conserved or non- 

1 0 conserved nature. Conserved amino acid substitutions involve replacing one or more amino adds of a KLK15 
Protein amino acid sequence with amino acids of similar charge, size, and/or hydrophobicity characteristics. 
When only conserved substitutions are made the resulting analog is preferably functionally equivalent to a 
KLK15 Protein. Non-conserved substitutions involve replacing one or more amino acids of the KLK15 Protein 
amino acid sequence with one or more amino acids that possess dissimilar charge, size, and/or hydrophobicity 

15 characteristics. 

One or more amino acid insertions may be introduced into a KLK15 Protem. Amino acid insertions 
may consist of single audno acid residues or sequential amino acids ranging from 2 to 15 amino adds in length. 

Deletions may consist of the removal of one or more amino adds, or discrete portions from a KLK15 
Protein sequence. The deleted amino adds may or may not be contiguous. The lower Umit length of the 
2 0 resulting analog with a deletion mutation is about 10 ammo acids, preferably 20 to 40 amino acids. 

The proteins of the invention include proteins with sequence identity or similarity to a KLK15 Protein 
and/or truncations thereof as described herein. Such KLKIS Proteins include proteins whose amino acid 
sequences are comprised of the amino acid sequences of KLK15 Protein regions from other species that 
hybridize under selected hybridization conditions (see discussion of stringent hybridization conditions herein) 

2 5 with a probe used to obtain a KLK15 Protem. These proteins will generally have the same regions which are 

characteristic of a KLKIS Protein. Preferably a protein will have substantial sequence identity for example, 
about S5%, 60%, 65%, 70%. 7S%, 80%, or 85% identity, preferably 90% identity, more preferably at least 
95%, 96%, 97%, 98%, or 99% identity, and most preferably 98% identity with an amino acid sequence of 
SEQ.ID.no. 6, 7, 8, or 9. A percent amino acid sequence homology, similarity or identity is calculated as the 

3 0 percentage of aligned amino acids that match the reference sequence using known methods as described herein. 

The invention also contemplates isoforms of the proteins of the invention. An isoform contains the 
same number and kinds of amino acids as a protein of the invention, but the isoforai has a different molecular 
structure. Isoforms conteQ^}lated by the present invention preferably have the same properties as a protein of 
the invention as described herein. 
3 5 The present invention also includes KLKIS Related Proteins conjugated with a selected protein, or 

a marker protein (see below) to produce fusion protems. Additionally, immunogenic portions of a KLKIS 
Protein and a KLKIS Protein Related Protein are within the scope of the invention. 
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A KLK15 Related Protein of the invention may be prepared using recombinant DNA methods. 
Accordingly, the nucleic acid molecules of the present invention having a sequence which encodes a KLK15 
Related Protein of the invention may be incorporated in a known mamier into an appropriate expression vector 
which ensures good expression of the protein. Possible expression vectors include but are not limited to 
5 cosmids, plasmids, or modified viruses (e.g. replication defective retroviruses, adenoviruses and adeno- 
associated viruses), so long as the vector is compatible with the host cell used. 

The invention therefore contemplates a recombinant expression vector of the invention containing a 
nucleic acid molecule of the invention, and the necessary regulatory sequences for the transcription and 
translation of the inserted protein-sequence. Suitable regulatory sequences may be derived from a variety of 

10 sources, including bacterial, fungal, viral, mammalian, or insect genes [For example, see the regulatory 
sequences described in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, 
San Diego, CA (1990)]. Selection of appropriate regulatory sequences is dependent on the host cell chosen as 
discussed below, and may be readily accomplished by one of ordinary skill in the art. The necessary regulatory 
sequences may be supplied by the native KLK15 Protein and/or its flanking regions. 

15 The invention further provides a recombinant expression vector comprising a DNA nucleic acid 

molecule of the invention cloned into the expression vector in an antisense orientation. That is, the DNA 
molecule is linked to a regulatory sequence in a manner which allows for expression, by transcription of the 
DNA molecule, of an RNA molecule which is antisense to the nucleic acid sequence of a protein of the 
invention or a fragment thereof. Regulatory sequences linked to the antisense nucleic acid can be chosen which 

2 0 direct the continuous expression of the antisense RNA molecule in a variety of cell types, for instance a viral 

promoter and/or enhancer, or regulatory sequences can be chosen which direct tissue or cell type specific 
expression of antisense RNA. 

The recombinant expression vectors of the invention may also contain a marker gene which facilitates 
the selection of host cells transformed or transfected with a recombinant molecule of the invention. Examples 
25 of marker genes are genes encoding a protein such as G418 and hygromycin ^ch confer resistance to certain 
drugs, P-galactosidase, chloran:qphenicol acetyltransferase, firefly lucif<^e, or an immunoglobulin or portion 
thereof such as tiie Fc portion of an immunoglobulin, prefi^^ly IgG. The markers can be introduced on a 
separate vector from the nucleic acid of interest 

The recombinant expression vectors may also contain genes that encode a fusion moiety which 

3 0 provides increased expression of the recombinant protein; increased solubility of the recombinant protein; and 

aid in the purification of the target recombinant protein by acting as a ligand in affinity purification. For 
example, a proteolytic cleavage site may be added to the target recombinant protein to allow separation of the 
recombinant protein from the fusion moiety subsequent to purification of the fusion protein. Typical fusion 
expression vectors include pGEX (Amrad Corp., Melbourne, Australia), pMAL (New England Biolabs, 
3 5 Beverly, MA) and pRTTS (Pharmacia, Piscataway, NJ) which fiise glutathione S-transferase (GST), maltose 
E binding protein, or protein A, respectively, to die recombinant protein. 

The recombinant expression vectors may be introduced into host cells to produce a transformant host 
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cell. "Transfonnant host cells" include host cells which have been transformed or transfected witib a 
recombinant expression vector of the invention. The terms "transformed with", "transfected with", 
"transformation" and "transfection" encompass the introduction of a nucleic acid (e.g. a vector) into a cell by 
one of many standard techniques. Prokaryotic cells can be transformed with a nucleic acid by, for example, 
5 electroporation or calcium-chloride mediated transfoimation. A nucleic acid can be introduced into mammalian 
cells via conventional techniques such as calcium phosphate or calcium chloride co-precipitation, DEAE- 
dextran-mediated transfection, lipofectin, electroporation or microinjection. Suitable methods for transfonning 
and traosfecting host cells can be found in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 2nd 
Edition, Cold Spring Harbor Laboratory press (1989)), and other laboratory textbooks. 

1 0 Suitable host cells include a wide variety of prokaryotic and eukaryotic host cells. For example, the 

proteins of the invention may be expressed in bacterial cells such as E. colU insect cells (using baculovirus), 
yeast cells, or mammalian cells. Other suitable host cells can be found in Goeddel, Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1991). 

A host cell may also be chosen which modulates the expression of an inserted nucleic acid sequence, 

15 or modifies (e.g. glycosylation or phosphorylation) and processes (e.g. cleaves) the protein in a desired fashion. 
Host systems or cell lines may be selected which have specific and characteristic mechanisms for post- 
translational processing and modification of protems. For example, eukaryotic host cells including CHO, 
VERO, BHK. HeLA, COS, MDCK, 293, 3T3, and WI38 may be used. For long-term high-yield stable 
expression of the protem, cell lines and host systems which stably express the gene product may be engineered. 

2 0 Host cells and in particular cell lines produced using the methods described herein may be particularly 

useful in screening and evaluating compounds that modulate the activity of a KLK15 Related Protein. 

The proteins of the invention may also be expressed in non-human transgenic animals including but 
not limited to mice, rats, rabbits, guinea pigs, micro-pigs, goats, sheep, pigs, non-human primates (e.g. baboons, 
monkeys, and chimpanzees) [see Hammer et al. (Nature 315:680-683, 1985), Palmiter et aL (Science 222:809- 

2 5 814. 1983). Brinster et al. (Free Nad. Acad. Sci USA 82:44384442. 1985), Palmiter and Brinster (CeU. 41:343- 

345, 1985) and U.S. Patent No. 4,736,866)]. Procedures known in the art may be used to introduce a nucleic 
acid molecule of the invention encoding a KLK15 Related Protein into animals to produce die founder lines 
of transgenic animals. Such procedures include pronuclear micromjection, retrovirus mediated gene transfer 
into germ lines, gene targeting in mbiyonic stem cells, electroporation of embryos, and sperm-mediated gene 

3 0 transfer. 

The present invention contemplates a transgenic animal that carries the KLK15 gene in all then* cells, 
and animals which carry the transgene in some but not all their cells. The transgene may be integrated as a 
single transgene or in concatamers. The transgene may be selectively introduced into and activated in specific 
cell types (See for exan^le, Lasko et al, 1992 Proc. Natl. Acad. Sci. USA 89: 6236). The transgene may be 
3 5 integrated into the chromosomal site of the endogenous gene by gene targeting. The transgene may be 
selectively introduced into a particular ceU type inactivating the endogenous gene in that cell type (See Gu et 
al Science 265: 103-106). 
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The expression of a recombinant KLK15 Related Protein in a transgenic animal may be assayed using 
standard techniques. Initial screening may be conducted by Southern Blot analysis, or PGR methods to analyze 
whether the transgene has been integrated. The level of mRNA expression in the tissues of transgenic animals 
may also be assessed using techniques including Northern blot analysis of tissue samples, in situ hybridization, 
5 and RT-PCR. Tissue may also be evaluated immunocytochemically using antibodies against KLK15 Protein. 

Proteins of the invention may also be prepared by chemical synthesis using techniques well known 
in the chemistry of proteins such as solid phase synthesis (Merrifield, 1964, J. Am. Chem. Assoc. 85:2149- 
2154) or synthesis in homogenous solution (Houbenwejd, 1987» Methods of Organic Chemistry, ed. E. 
Wansch, Vol. 15 1 and H, Thieme. Stuttgart). 

10 N-terminal or C-terminal fusion proteins comprising a KLK15 Related Protein of the invention 

conjugated with other molecules, such as proteins, may be prepared by fusing, through recombinant techniques, 
the N-terminal or C-terminal of a KLK15 Related Protein, and the sequence of a selected protein or marker 
protein with a desired biological function. The resultant fusion proteins contain a KLK15 Protein fused Co the 
selected protein or marker protein as described herein. Examples of proteins which may be used to prepare 

1 5 fusion proteins include immunoglobulins, giutathione-S-transferase (GST), hemagglutinin (HA), and truncated 
myc. 

3. Antibodies 

KLK15 Related Proteins of the invention can be used to prepare antibodies specific for the proteins. 
Antibodies can be prepared which bind a distinct epitope in an unconsorved region of the protein. An 
2 0 unconserved region of the protein is one that does not have substantial sequence homology to other proteins. 
A region from a conserved region such as a well-characterized domain can also be used to prepare an antibody 
to a conserved region of a KLK15 Related Protein. Antibodies having specificity for a KLK15 Related 
Protein may also be raised from fusion proteins created by expressing fusion proteins in bacteria as described 
herein. 

2 5 The invention can employ intact monoclonal or polyclonal antibodies, and immunologically active 

fragments (e.g. a Fab, (Fab)2 fragment, or Fab expression library fragments and epitope-binding fragments 
thereof), an antibody heavy chain, and antibody light chain, a genetically ^gme«^ single chain Fv molecule 
(Ladner et al, U.S. Pat. No. 4,946,778), humanized antibody, or a chimeric antibody, for example, an antibody 
which contains the binding specificity of a murine antibody, but in which the remaining portions are of human 

3 0 origin. Antibodies including monoclonal and polyclonal antibodies, fragments and chimeras, may be prepared 

using methods known to those skilled in the art. 

4. Applications of the Nucleic Add Molecules. KLKIS Related Proteins, and Antibodies of the 
Invention 

The nucleic acid molecules, KLK15 Related Protdns, and antibodies of the invention may be used 
35 in the prognostic and diagnostic evaluation of disorders involving a KLK15 Related Protein (e.g. cancer or 
thyroid disorders), and the identification of subjects with a predisposition to such disorders (Section 4.1.1 and 
4.1.2). 
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In an embodiment of the invention, a method is provided for detecting the expression of the cancer 
marker KLK15 in a patient comprising: 

(a) taking a sample derived from a patient; and 

(b) detecting in the sample a nucleic acid sequence encoding KLK15 or a protein product encoded 
5 by a KLK15 nucleic acid sequence. 

In a particular embodiment of the invention, the nucleic add nx>lecules, KLK15 Rdated Proteins, and 
antibodies of the invention may be used in the diagnosis and staging of cancer, in particular prostate cancor. 
Increased levels of KLK15 Related Proteins are associated with more aggressive forms of prostate cancer and 
may be an indicator of poor prognosis. 

1 0 Methods for detecting nucleic acid molecules and KLK15 Related Proteins of the invention, can be 

used to monitor disorders involving a KLK15 Related Protein by detecting KLK15 Related Proteins and 
nucleic acid molecules encoding KLK15 Related Proteins. The applications of the present invention also 
include methods for the identification of compounds that modulate the biological activity of KLK15 Related 
Proteins (Section 4.2), The compounds, antibodies etc. may be used for the treatment of disorders involving 

15 a KLK15 Related Protein (Section 4.3). It would also be apparent to one skilled in the art that the methods 

r 

described herein may be used to study the developmental expression of KLK15 Related Proteins and, 
accordingly, will provide further insight into the role of KLK15 Related Proteins. 
4.1 Diagnostic Methods 

A variety of methods can be employed for the diagnostic and prognostic evaluation of disorders 
2 0 involving a KLK15 Related Protein, and the identification of subjects with a predisposition to such disorders. 
Such methods may, for example, utilize nucleic acid molecules of the invention, and fragments thereof, and 
antibodies directed against KLK15 Related Proteins, including peptide fragments. In particular, the nucleic 
acids and antibodies may be used, for example, for: (1) the detection of the presence of KLK15 mutations, or 
the detection of either over- or under-expression of KLK15 mRNA relative to a non-disorder state or the 

2 5 qualitative or quantitative detection of alternatively spliced forms of KLK15 transcripts which may correlate 

with certain conditions or susceptibility toward such conditions; and (2) the detection of either an over- or an 
under-abundance of KLK15 Related Proteins relative to a non- disorder state or the presence of a modified 
(e.g., less than fiill length) KLK15 Protein which correlates with a disorder state, or a progression toward a 
disorder state. 

3 0 The methods described herein may be used to evaluate the probability of the presence of malignant 

or pre-malignant cells, for example, in a group of cells fi-eshly removed from a host Such methods can be used 
to detect tumors, quantitate their growth, and help in the diagnosis and prognosis of disease. The mediods can 
be used to detect the presence of cancer metastasis, as well as confirm the absence or removal of all tumor 
tissue foUowmg surgery, cancer chemotherapy, and/or radiation therapy. They can fiirther be used to monitor 
3 5 cancer chemotherapy and tumor reappearance. 

The methods described herein may be performed by utilizing pr&-packaged diagnostic kits comprising 
at least one specific KLKIS nucleic acid or antibody described herein, which may be conveniently used, e.g., 
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in clinical settings, to screen and diagnose patients and to screen and identify those individuals exhibiting a 
predisposition to developing a disorder. 

Nucleic acid-based detection techniques are described, below, in Section 4.1.1. Peptide detection 
techniques are described, below, in Section 4.1.2. The samples that may be analyzed using the methods of the 
5 invention include those which are known or suspected to express KLK15 or contain KLK15 Related Proteins. 
The samples may be derived from a patient or a cell culture, and include but are not limited to biological flmds, 
tissue extracts, freshly harvested cells, and lysates of cells which have been incubated in cell cultures. 

Oligonucleotides or longer fragments derived from any of the nucleic acid molecules of the invention 
may be used as targets in a microarray. The microarray can be used to simultaneously monitor the expression 
10 levels of large numbers of genes and to identify genetic variants, mutations, and polymorphisms. The 
information from the microarray may be used to determine gene function, to understand the genetic basis of 
a disorder, to diagnose a disorder, and to develop and monitor the activities of therapeutic agents. 

The preparation, use, and analysis of microarrays are well known to a person skilled in the art. (See, 
for example, Brennan, T. M. et al. (1995) U.S. Pat. No. 5,474,796; Schena, et al. (1996) Proc. Natl. Acad. Sci. 
15 93:10614-10619; Baldeschweil^ et al. (1995), PCT AppUcation W095/251 1 16; Shalon, D. et al. (1 995) PCT 
appUcation WO95/35505; Heller, R. A. et al. (1997) Proc. Natl. Acad. Sci. 94:2150-2155; and Heller, M, J. 
et al. (1997) U.S. Pat No. 5,605,662.) 

4.1.1 Methods for Detecttng Nucleic Add Moleailes off the Invention 

The nucleic acid molecules of the invention allow those skilled in the art to construct nucleotide 
2 0 probes for use in the detection of nucleic add sequences of the invention in samples. Suitable probes include 
nucleic acid molecules based on nucleic acid sequences encoding at least 5 sequential amino acids fix)m regions 
of the KLK15 Protein, preferably they comprise 15 to 30 nucleotides (see SEQ ID Nos. 47-50). A nucleotide 
probe may be labeled with a detectable substance such as a radioactive label which provides for an adequate 
signal and has sufficient half-life such as ^^P, ^H, ^^C or the like. Other detectable substances which n:iay be 

2 5 used include antigens that are recognized by a specific labeled antibody, fluorescent compounds, enzymes, 

antibodies specific for a labeled antigen, and luminescent compounds. An appropriate label may be selected 
having regard to the rate of hybridization and binding of die probe to the nucleotide to be detected and the 
amount of nucleotide available for hybridization. Labeled probes may be hybridized to nucleic acids on solid 
supports such as nitrocellulose filters or nylon membranes as generally described in Sambrook et al, 1989, 

3 0 Molecular Cloning, A Laboratory Manual (2nd ed.). The nucleic acid probes may be used to detect genes, 

preferably in human cells, that encode KLK15 Related Proteins. The nucleotide probes may also be useful in 
the diagnosis of disorders involving a KLK15 Related Protein; in monitoring the progression of such disorders; 
or monitoring a therapeutic treatment 

The probe may be used in hybridization techniques to detect genes that encode KLK15 Related 
3 5 Proteins. The technique generally involves contacting and incubatmg nucleic acids (e.g. recombinant DNA 
molecules, cloned genes) obtained from a sample from a patient or other cellular source with a probe of the 
present invention under conditions favorable for the specific annealing of the probes to complementary 
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sequences in the nucleic acids. After incubation, the non-annealed nucleic acids are removed, and the presence 
of nucleic acids that have hybridized to the probe if any are detected. 

The detection of nucleic acid molecules of the invention may involve the amplification of specific 
gene sequences using an amplification method such as PGR, followed by the analysis of the amplified 
5 molecules using techniques known to those skilled in the art Suitable primers can be routinely designed by one 
of skill in the art 

Genomic DNA maybe used in hybridization or amplification assays of biological samples to detect 
abnormalities involving KLKIS structure, including point mutations, insertions, deletions, and chromosomal 
rearrangements. For example, direct sequencing, smgle stranded confonnational polymorphism analyses, 
10 heteroduplex analysis, denaturing gradient gel electrophoresis, chemical mismatch cleavage, and 
oligonucleotide hybridization may be utilized. 

Genotyping techniques known to one skilled in the art can be used to type polymorphisms that are in 
close proximity to the mutations in a klkl5 gene. The polymorphisms may be used to identify individuals in 
families that are likely to carry mutations. If a polymorphism exhibits linkage disequalibrium with mutations 
15 in a KLK15 gene, it can also be used to screen for individuals in the general population likely to carry 
mutations. Polymorphisms which may be used include restriction fragment length polymorphisms (RFLPs), 
smgle-base polymorphisms, and simple sequence repeat polymorphisms (SSLPs). 

A probe of the invoition may be used to directly identify RFLPs. A probe or primer of the invention 
can additionally be used to isolate genomic clones such as YACs, B AGs, PACs, cosmids, phage or plasmids. 
2 0 The DNA in the clones can be screened for SSLPs using hybridization or sequencing procedures. 

Hybridization and amplification techniques described herein may be used to assay qualitative and 
quantitative aspects of klklS expression. For example, RNA may be isolated fi-om a cell type or tissue known 
to express kUcJS and tested utilizing the hybridization (e.g. standard Northern analyses) or PGR techniques 
referred to h^ein. The techniques may be used to detect differences in transcript size which may be due to 

2 5 normal or abnormal alternative splicing. Tbs techniques may be used to detect quantitative differences between 

levels of full length and/or alternatively splice transcripts detected in normal individuals relative to those 
individuals exhibiting symptoms of a disorder involving a KLK15 Related Protein. 

The primers and probes may be used in the above described methods in situ i.e directly on tissue 
sections (fixed and/or firozen) of patient tissue obtained from biopsies or resections. 
30 4.1.2 Methods for Detecting KLK15 Related Proteins 

Antibodies specifically reactive with a KLK15 Related Protein, or derivatives, such as enzyme 
conjugates or labeled derivatives, may be used to detect KLK15 Related Proteins in various samples (e.g. 
biological materials). They may be used as diagnostic or prognostic reagents and they may be used to detect 
abnormalities in the level of KLK15 Related Protein expression, or abnormalities in the structure, and/or 

3 5 tenqjoral, tissue, cellular, or subcellular location of a KLK15 Related Protem. Antibodies may also be used to 

screen potentially therapeutic compounds in vitro to determine their effects on disorders involving a KLK15 
Related Protein, and other conditions. In vitro immunoassays may also be used to assess or monitor the efBcacy 
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of particular therapies. The antibodies of the invention may also be used in vitro to detoinine the level of 
KLK15 expression in cells genetically engineered to produce a KLK15 Related Protein. 

The antibodies may be used in any known immunoassays which rely on the binding interaction 
between an antigenic determinant of a KLKI5 Related Protein and the antibodies. Examples of such assays are 
5 radioimmunoassays, enzyme immunoassays (e.g. ELISA), immunofluorescence, immunoprecipitation, latex 
agglutination, hemagglutination, and histochemical tests. The antibodies may be used to detect and quantify 
KLK15 Related Proteins in a sanq)le in order to determine its role in particular cellular events or pathological 
states, and to diagnose and treat such pathological states. 

In particular, the antibodies of the invention may be used in immuno-histochemical analyses, for 

1 0 exan]|)le, at the cellular and sub-subcellular level, to detect a KLK15 Related Protein, to localize it to particular 
cells and tissues, and to specific subcellular locations, and to quantitate the level of expression, 

Cytochemical techniques known in the art for localizing antigens using Hght and electron microscopy 
may be used to detect a KLK15 Related Protein. Generally, an antibody of the invention may be labeled with 
a detectable substance and a KLK15 Related Protein may be locahsed in tissues and cells based upon the 

1 5 presence of die detectable substance. Examples of detectable substances include, but are not limited to, the 
following: radioisotopes (e.g., ^ H, "C, ^^S, "^I), fluorescent labels (e.g., FTTC, rhodamine, lanthanide 
phosphors), luminescent labels sudi as luminol; enzymatic labels (e.g., horseradish peroxidase, beta- 
galactosidase, luciferase, alkaline phosphatase, acetylcholinesterase), biotinyl groups (which can be detected 
by marked avidin e.g., streptavidin containing a fluorescent maiker or enzymatic activity that can be detected 

20 by optical or calorimetric metiiods), predetermined polypeptide epitopes recognized by a secondary reporter 
(e.g., leucine zipper pair sequences, bindmg sites for secondary antibodies, metal binding domains, epitope 
tags). In some embodiments, labels are attached via spacer arms of various lengths to reduce potential steric 
hindrance. Antibodies may also be coupled to electron dense substances, such as ferritin or colloidal gold, 
which are readily visualised by electron microscopy. 

25 The antibody or sample may be inmiobilized on a carrier or solid support which is capable of 

immobilizing cells, antibodies etc. For example, the carrier or support may be nitrocellulose, or glass, 
polyaoylamides, gabbros, and magnetite. The support mat^al may have any possible configuration including 
spherical (e.g. bead), cylindrical (e.g. ioside surface of a test tube or well, or the ext^al surface of a rod), or 
flat (e.g. sheet, test strip). Indirect methods may also be employed in which die primary antigen-antibody 

3 0 reaction is amplified by the introduction of a second antibody, having specificity for the antibody reactive 
against KLK15 Related Protein. By way of example, if the antibody having specificity a KLK15 Related 
Protein is a rabbit IgG antibody, the second antibody may be goat anti-rabbit ganmia-globulin labeled with a 
detectable substance as described herein. 

Where a radioactive label is used as a detectable substance, a KX*K15 Related Protein may be 

3 5 localized by radioautography. The results of radioautography may be quantitated by deteimining the density 
of particles in the radioautographs by various optical methods, or by coimting the grains. 

In an embodiment, the invention contemplates a method for monitoring the progression of canc^ (e.g. 
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prostate cancer) in an individual, comprising: 

(a) contacting an amount of an antibody which binds to a KLK15 Related Protein, with a sample from 
the individual so as to form a binary complex comprising the antibody and KLK15 Related Protein 
in the sample; 

5 (b) detennining or detecting the presence or amount of complex formation in the sample; 

(c) repeating steps (a) and (b) at a point later in time; and 

(d) comparing the result of step (b) with the result of step (c), wherein a difference in the amount of 
complex formation is indicative of the progression of the cancer in said individual. 

The amount of complexes may also be compared to a value representative of the amount of the 
1 0 complexes from an individual not at risk of, or afflicted with, cancer (e,g. prostate cancer). 
4.2 Methods for Identifying or Evaluating Substances/Compoands 

The methods described herein are designed to identify substances that modulate the biological activity 
of a KLK15 Related Protein including substances that bind to KLK15 Related Proteins, or bind to other 
proteins that interact with a KLK15 Related Protein, to compounds that interfere with, or enhance the 
15 interaction of a KLK15 Related Protein and substances that bind to the KLK15 Related Protein or other 
proteins that interact widi aKLKlS Related Protein. Methods are also utilized that identify compounds diat 
bind to KLK15 regulatory sequences. 

The substances and compounds identified using the methods of the invention include but are not 
limited to peptides such as soluble peptides including Ig-tailed fusion peptides, members of random peptide 
2 0 libraries and combinatorial chemistry-derived molecular libraries made of D- and/or L-configuration amino 
acids, phosphopeptides (including members of random or partially degenerate, directed phosphopeptide 
libraries), antibodies [e.g. polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, single chain antibodies, 
fragments, (e.g. Fab, F(ab)2, and Fab expression library fragments, and epitope-binding fragments thereof)], 
and small organic or inorganic molecules. The substance or compound may be an endogenous physiological 

2 5 compound or it may be a natural or synthetic compound. 

Substances which modulate a KLK15 Related Protein can be identified based on their ability to bind 
to a KLK15 Related Protein. Therefore, the mvendon also provides methods for identifying substances which 
bind to a KLK15 Related Protein, Substances identified using the methods of the invention may be isolated, 
cloned and sequenced using conventional techniques. A substance that associates with a polypeptide of the 

3 0 invention may be an agonist or antagonist of the biological or immunological activity of a polypeptide of the 

invention. 

The term "agonist", refors to a molecule that increases the amount of, or prolongs the duration of, the 
activity of the protein. The term "antagonist" refers to a molecule which decreases the biological or 
immunological activity of the protein. Agonists and antagonists may include proteins, nucleic adds, 
3 5 carbohydrates, or any other molecules that associate with a protein of the invention. 

Substances which can bind with a KLK15 Related Protein may be identified by reacting a KLK15 
Related Protein with a test substance which potentially binds to a KLK15 Related Protein, under conditions 
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which penmt the fonnatioD of substance-KLK15 Related Protein complexes and removing and/or detecting 
the complexes. The complexes can be detected by assaymg for substance-KLK15 Related Protein complexes, 
for free substance, or for non-complexed KLK15 Related Protein, Conditions which permit the formation of 
substance-KLK15 Related Protein complexes may be selected having regard to factors such as the nature and 
5 amounts of the substance and the protein. 

The substance-protein complex, free substance or non-complexed proteins may be isolated by 
conventional isolation techniques, for example, salting out, chromatography, electrophoresis, gel filtration, 
fractionation, absorption, polyacrylamide gel electrophoresis, agglutination, or combinations thereof. To 
fadUtate the assay of the components, antibody against KLK15 Related Protein or the substance, or labeled 

1 0 KLK15 Related Protein, or a labeled substance may be utilized. The antibodies, proteins, or substances may 
be labeled with a detectable substance as described above. 

A KLK15 Related Protein, or the substance used in the method of the invention may be insolubilized. 
For example, a KLK15 Related Protein, or substance may be bound to a suitable carrier such as agarose, 
cellulose, dextran, Sephadex, Sepharose, carboxymethyl cellulose polystyrene, filter paper, ion-exchange resin, 

15 plastic film, plastic tube, glass beads, polyamine-methyl vinyl-ether-maleic acid copolymer, amino acid 
copolymer, ethylene-maleic acid copolymer, nylon, silk, etc. The carrier may be in die shape of, for example, 
a tube, test plate, beads, disc, sphere etc. The insolubilized protein or substance may be prepared by reactmg 
the material with a suitable insoluble cairier using known diemical or physical methods, for example, cyanogen 
bromide coupling. 

2 0 The invention also contemplates a method for evaluating a compound for its ability to modulate the 

biological activity of a KLK15 Related Protein of the invention, by assaying for an agonist or antagonist (i.e. 
enhancer or inhibitor) of the bmding of a KLK15 Related Protein with a substance which binds with a KLK15 
Related Protein. The basic method for evaluating if a compound is an agonist or antagonist of the binding of 
a KLK15 Related Protein and a substance that binds to the protein, is to prepare a reaction mixture containing 
25 the KLK15 Related Protein and the substance under conditions which permit the formation of substance- 
KLKIS Related Protein complexes, m the presence of a test compound. The test compound may be initially 
added to the mixture, or nuy be added subsequent to the addition of the KLK15 Related Protein and substance. 
Control reaction mixtures without the test compound or with a placebo are also prepared. The formation of 
complexes is detected and the formation of complexes in the control reaction but not in the reaction mutture 

3 0 indicates that the test compound interferes with the interaction of the KLK15 Related Protein and substance. 

The reactions may be carried out in the liquid phase or the KLK15 Related Protein, substance, or test 
compound may be immobilized as described herein. The abihty of a compound to modulate the biological 
activity of a KLK15 Related Protein of the invention may be tested by determining the biological effects on 
cells. 

35 It will be understood that the agonists and antagonists i.e. inhibitors and enhancers that can be assayed 

using the methods of the invention may act on one or more of the binding sites on the protein or substance 
including agonist binding sites, competitive antagonist bmding sites, non-competitive antagonist binding sites 
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or allosteric sites. 

The invention also makes it possible to screen for antagonists that inhibit the effects of an agonist of 
the interaction of KLK15 Related Protein with a substance which is capable of binding to the KLK15 Related 
Protein. Thus, the invention may be used to assay for a compound that competes for the same binding site of 
5 a KLK15 Related Protein. 

The invention also contemplates methods for identifying compounds that bind to proteins that interact 
with a KLK15 Related Protein. Protein-protein intaiictions may be identified using conventional methods such 
as co-immunoprecipitation, crosslinking and co-purification through gradients or chromatographic columns. 
Methods may also be employed that result in die simultaneous identification of genes which encode proteins 
1 0 interacting with a KLK15 Related Protein. These methods include probing expression libraries with labeled 
KLK15 Related Protein. 

Two-hybrid systems may also be used to detect protein interactions m vivo. Generally, plasmids are 
constructed that encode two hybrid proteins. A first hybrid protein consists of die DNA-binding domain of a 
transcription activator protein fiised to a KLK15 Related Protein, and the second hybrid protein consists of 
15 the transcription activator protein^ activator domain fused to an unknown protein encoded by a cDNA which 
has been recombined into the plasmid as part of a cDNA library. The plasmids are transformed into a strain 
of yeast (e.g. S. cerevisiae) tiiat contains a reporter gene (e.g. lacZ, lucifoase, alkaline phosphatase, hoiseradish 
peroxidase) whose regulatory region contains the transcription activator^ binding site. The hybrid proteins 
alone cannot activate the transcription of the reporter gene. However, interaction of the two hybrid proteins 
2 0 reconstitutes the functional activator protein and results in expression of the reports: gene, which is detected 
by an assay for the reporter gene product. 

It will be appreciated that fusion proteins may be used in the above-described methods. In particular, 
KLK15 Related Proteins fused to a glutathione-S-transferase may be used in the methods. 

A modulator of a KLK15 Related Protein of the invention may also be identified based on its ability 
25 to inhibit or enhance catalytic activity of the protein. 

The reagents suitable for applying the methods of the mvention to evaluate compounds that modulate 
a KLK15 Related Protein may be packaged into convenient kits providing the necessary materials packaged 
into suitable containers. The kits may also include suitable supports useful in p^ormmg the methods of the 
invention. 

30 43 Compositions and Treatments 

The proteins of the invention, substances or compounds identified by the methods described herein, 
antibodies, and nucleic acid molecules of the invention may be used for modulating the biological activity of 
a KLK15 Related Protein, and they may be used in the treatment of conditions such as cancer (particularly 
thyroid, prostate, colon, kidney, testicular cancer) and thyroid disorders in a patient. 

35 Accordingly, the substances, antibodies, peptides, and compounds may be formulated into 

pharmaceutical compositions for administration to subjects in a biologically compatible form suitable for 
administration in vivo. By "biologically compatible form suitable for administration in vivo" is meant a form 
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of the active substance to be a±ninistered in which any toxic effects are outweighed by the therapeutic effects. 
The active substances may be administered to livmg organisms including humans and animals. Administration 
of a therapeutically active amount of a pharmaceutical composition of the present invention is defined as an 
amount effective, at dosages and for periods of time necessary to achieve the desired result. For example, a 
5 therapeutically active amount of a substance may vary according to factors such as the disease state, age, sex, 
and weight of the individual, and the ability of antibody to elicit a desired response in the individual. Dosage 
regima may be adjusted to provide the optimum therapeutic response. For example, sev^ divided doses may 
be administered daily or the dose may be proportionally reduced as mdicated by the exigencies of the 
therapeutic situation. 

1 0 The active substance may be administered in a convenient manner such as by injection (subcutaneous, 

intravenous, etc.), oral administration, inhalation, transdermal application, or rectal administration. Depending 
on the route of administration, the active substance may be coated in a material to protect the substance from 
the action of enzymes, acids and other natural conditions that may inactivate the substance. 

The compositions described herein can be prepared by per se known methods for the preparation of 

15 pharmaceutically acceptable compositions which can be administered to subjects, such that an effective 
quantity of the active substance is combined in a mixture with a pharmaceutically acceptable vehicle. Suitable 
vehicles are described, for example, in Remington^ Pharmaceutical Sciences (Remington^ Pharmaceutical 
Sciences, Mack Publishing Company, Easton, Pa., USA 1985). On this basis, die compositions include, albeit 
not exclusively, solutions of the active substances in association with one or more pharmaceutically acceptable 

2 0 vehicles or diluents, and contained in buffered solutions witii a suitable pH and iso-osmotic with die 
physiological fluids. 

The compositions are indicated as therapeutic agents either alone or in conjunction with other 
therapeutic agents or other forms of treatment (e.g. chemotherapy or radiotherapy). For example, the 
compositions may be used in combination with anti-proliferative agents, antimicrobial agents, 

2 5 immunostimulatory agents, or anti-inflammatories. In particular, the compounds may be used in combination 

witii anti-viral and/or anti-proliferative agents. The compositions of the invention may be administered 
concuirentiy, separately, or sequentially wifli other tiierapeutic agents or th^apies. 

Vectors derived from retroviruses, adenovirus, herpes or vaccinia viruses, or from various bacterial 
plasmids, may be used to deliver nucleic acid molecules to a targeted organ, tissue, or ceU population. Methods 

3 0 well known to those skilled in the art may be used to construct recombinant vectors which will express 

antisense nucleic acid molecules of the invention. (See, for example, the techniques described in Sambrook et 
al (supra) and Ausubel et al (supra)). 

The nucleic acid molecules comprising full lengtii cDNA sequences and/or their regulatory elements 
enable a skilled artisan to use sequences encoding a protein of the invention as an investigative tool in sense 
3 5 (Youssoufian H and H F Lodish 1993 Mol Cell Biol 13:98-104) or antisense (Eguchi et al (1991) Annu Rev 
Biochem 60:631-652) regulation of gene function. Such technology is well known in the art, and sense or 
antisense oligomers, or larger fragments, can be designed from various locations along tiie coding or control 
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regions. 

Genes encoding a protein of the invention can be turned off by transfecting a cell or tissue with 
vectors which express high levels of a desired KLK15-encoding fragment. Such constructs can inundate cells 
with untranslatable sense or antisense sequences. Even in the absence of integration into the DNA, such vectors 
5 may continue to transcribe RNA molecules until all copies are disabled by endogenous nucleases. 

Modifications of gene expression can be obtained by designing antisense molecules, DNA, RNA or 
PNA, to die regulatory regions of a gene encoding a protein of the invention, i.e., the promote, enhancers, 
and introns. Preferably, oligonucleotides are derived from the transcription initiation site, eg, between -10 and 
+10 regions of the leader sequence. The antis^ose molecules may also be designed so that fhey block translation 

10 of mRNA by preventing the transcript from bmding to ribosomes. Inhibition may also be achieved using "triple 
helix" base-pairing methodology. Triple helix pairing compromises the ability of the double helix to open 
sufficiently for the binding of polymerases, transcription factors, or regulatory molecules. Therapeutic advances 
using triplex DNA were reviewed by Gee J E et al (In: Huber B E and B I Carr (1994) Molecular and 
Inamunologic Approaches, Futura Publishing Co, Mt Kisco N.Y.). 

1 5 Ribozymes are enzymatic RNA molecules that catalyze the specific cleavage of RNA. Ribozymes act 

by sequence-specific hybridization of the ribozyme molecule to complementary target RNA, followed by 
endonucleolytic cleavage. The invention therefore contemplates engineered hamm^ead motif ribozyme 
molecules that can specifically and efficiently catalyze endonucleolytic cleavage of sequences encoding a 
protein of the invention. 

2 0 Specific ribozyme cleavage sites within any potential RNA target may initially be identified by 

scanning the target molecule for ribozyme cleavage sites which include the following sequences, GUA, GUU 
and GUC. Once the sites are identified, short RNA sequences of between 15 and 20 ribonucleotides 
corresponding to the region of the target gene containing the cleavage site may be evaluated for secondary 
structural features which may render the oligonucleotide inoperable. The suitability of candidate targets may 

2 5 also be detemoined by testing accessibility to hybridization with con^lem^tary oligonucleotides using 

ribonuclease protection assays. 

Methods for introducing vectors into cells or tissues include those methods discussed herein and 
which are suitable for in vivo, in vitro and ex vivo therapy. For ex vivo therapy, vectors may be introduced into 
stem cells obtained from a patient and clonally propagated for autologous transplant into the same patient (See 

3 0 U.S. Pat Nos. 5,399,493 and 5,437,994). Delivery by transfection and by liposome are well known in the art 

An antibody against a KLK15 Related Protein may be conjugated to chemotherapeutic drugs, toxins, 
unmunological response modifiers, hematogenous agents, enzymes, and radioisotopes and used in the 
prevention and treatment of cancer (e.g. thyroid, prostate, colon, kidney, testicular canca:). For exaD:q)le, an 
antibody against a KLK15 Related Protein may be conjugated to toxic moieties including but not linuted to 
3 5 ricin A, diphth^a toxin, abrin, modeccin, or bacterial toxins from Pseudomonas or Shigella. Toxins and'their 
derivatives have been reported to form conjugates with antibodies specific to particular target tissues, such as 
cancer or tumor cells in order to obtain specifically targeted cellular toxicity (Moolten F.L. et al, ImmuiL Rev. 
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62:47-72. 1982, and Bemhwd., M.L Cancer Res. 43:4420, 1983). 

Conjugates can be prepared by standard means known in the art. A number of bifunctional linking 

agents (e.g. heterobifunctional linkers such as N-succininiidyl-3-(2-pyridyldithio)propionate) are available 

commercially from Pierce Chemically Company, Rockford, 111. 
5 Administration of the antibodies or immunotoxins for therapeutic use may be by an intravenous route, 

although with proper formulation additional routes of administration such as intraperitoneal, oral, or 

transdermal administration may also be used. 

A KLK15 Related Protein may be conjugated to chemotherapeutic drugs, toxins, immunological 

response modifiers, enzymes, and radioisotopes using methods known in the art 
1 0 The invention also provides immunotherapeutic approaches for preventing or reducing the severity 

of a cancer. The clinical signs or symptoms of the cancer ia a subject are indicative of a beneficial effect to the 

patient due to the stimulation of the subject's immune response against the cancer. Stimulating an immune 

response refers to inducing an immune response or enhancing the activity of immunoeffector cells in response 

to administration of a vaccine preparation of the invention. The prevention of a cancer can be indicated by an 
1 5 increased time before the appearance of cancer in a patient that is predisposed to developing cancer due for 

example to a genetic disposition or e:q)0sure to a carcinogenic agent The reduction in die severity of a cancer 

can be indicated by a decrease in size or growtii rate of a tumor. 

Vaccines can be derived from a KLK Related Protein, peptides derived therefrom, or chemically 

produced synthetic peptides, or any combination of these molecules, or fusion proteins or peptides thereof. The 
2 0 proteins, peptides, etc. can be synthesized or prepared recombinantly or otherwise biologically, to comprise 

one or more amino acid sequences corresponding to one or more epitopes of a tumor associated protein. 

Epitopes of a tumor associated protein will be understood to include the possibility that in some instances 

amino acid sequence variations of a naturally occurring protein or polypeptide may be antigenic and confer 

protective immunity against cancer or anti-tumorigenic effects. Sequence variations may include without 

2 5 limitation, amino add substitutions, ext^ons, ddetions, truncations, interpolations, and combinations (hereof. 

Such variations fall within tiie scope of the invention provided the protein containing them is immunogenic and 
antibodies against such polypeptide cross-react with naturally occurring KLK15 Related Protein to a suftident 
extent to provide protective immunity and/or anti-tunoorigenic activity when admmistered as a vaccine. 

The proteins, peptides etc, can be incorporated into vaccines capable of inducing an inomune response 

3 0 using methods known in the art. Techniques for enhancing the antigenicity of the proteins, peptides, etc. are 

known in the art and include incorporation into a multimeric structure, binding to a highly immunogenic protein 
carder, for example, keyhole limpet hemocyanin (KLH), or diptheria toxoid, and admmistration in combination 
with adjuvants or any other enhancer of immune response. 

Vacdnes may be combined with physiologically acceptable media, including immunologicaUy 
3 5 acceptable diluents and carriers as well as commonly ^x^loyed adjuvants such as Freund's Complete Adjuvant, 
saponin, alum, and the like. 

It will be further appreciated that anti-idiotype antibodies to antibodies to KLK15 Related Proteins 
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described herein are also useful as vaccines and can be similarly formulated. 

The administration of a vaccine in accordance with the invention, is generally applicable to the 
prevention or treatment of cancers including thyroid, prostate, colon, kidney, and testicular cancer. 

The administration to a patient of a vaccine in accordance with the invention for the prevention and/or 
5 treatment of cancer can take place before or after a surgical procedure to remove the cancer, before or after a 
chemotherapeutic procedure for the treatment of cancer, and before or after radiation therapy for the treatment 
of cancer and any combination thereof. The cancer immunotherapy in accordance with the invention would be 
a preferred treatment for the prevention and /or treatment of cancer, since the side effects involved are 
substantially minimal compared with the other available treatments e.g. sui^ery, chemotherapy, radiation 
1 0 therapy. The vaccines have the potential or capability to prevent cancer in subjects without cancer but who are 
at risk of developing cancer. 

The activity of the proteins, substances, compounds, antibodies, nucleic acid molecules, agents, and 
compositions of the invention may be confirmed in animal experimental model systems. Therapeutic efficacy 
and toxicity may be determined by standard pharmaceutical procedures in cell cultures or with experimental 
1 5 animals, such as by calculating the ED50 ( the dose therapeutically effective in 50% of the population) or LDso 
(the dose lethal to 50% of the population) statistics. The therapeutic index is the dose ratio of therapeutic to 
toxic ^ects and it can be expressed as the ED50/LD5D ratio. Pharmaceutical con^ositions which exhibit large 
therapeutic mdices are preferred. 
4.4 Other Applications 

2 0 The nucleic acid molecules disclosed herein may also be used in molecular biology techniques that 

have not yet been developed, provided the new techniques rely on properties of nucleotide sequences that are 
currently known, including but not limited to such properties as the triplet genetic code and specific base pair 
interactions. 

The invention also provides methods for studying the function of a polypeptide of the invention. 

2 5 Cells, tissues, and non-human animals lackmg in expression or partially lacldng in expression of a nucleic acid 

molecule or gene of the invention may be developed using recombinant expression vectors of the invention 
having specific deletion or insertion mutations in the gene. A recombinant expression vector may be used to 
mactivate or alter the endogenous gene by homologous recombination, and thereby create a deficient cell, 
tissue, or animal. 

3 0 Null alleles may be generated in ceUs, such as embryonic stem cells by deletion mutation. A 

recombinant gene may also be engineered to contain an insertion mutation that inactivates the gene. Such a 
construct may then be introduced into a cell, such as an embryonic stem cell, by a technique such as 
transfection, electroporation, injection etc. Cells lacking an intact gene may then be identified, for example 
by Southern blotting. Northern Blotting, or by assaying for expression of the encoded polypeptide using the 
3 5 methods described herem. Such cells may then be fused to emibryonic stem cells to generate transgenic 
non-human animals deficient in a polypeptide of the invention. Germline transmission of the mutation may 
be achieved, for example, by aggregating the embryonic stem cells with early stage embryos, such as 8 cell 
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embryos, in vitro; transferring the resulting blastocysts into recipient females and; generating gennline 
transmission of the resulting aggregation chimeras. Such a mutant animal may be used to define specific cell 
populations, developmental patterns and in vivo processes, normally dependent on gene expression. 

The invention thus provides a transgenic non-human mammal all of whose germ cells and somatic 
5 cells contain a recombinant expression vector that inactivates or alters a gene encoding a KLKIS Related 
Protein. In an embodunent the invention provides a transgenic non-human manunal all of whose germ cells 
and somatic cells contain a recombmant expression vector that inactivates or alt^s a gene encoding a KLK15 
Related Protein resulting in a KLK15 Related Protein associated pathology. Further, the invention provides 
a transgenic non-human mammal which does not express or has altered (e.g. reduced) expression of a KLK15 
1 0 Related Protein of the invention. In an embodiment, the invention provides a transgenic non-human mammal 
which does not express or has reduced expression of a KLK15 Related Protein of the invention resulting in a 
KLK15 Related Protein associated pathology. A KLK15 Related Protein pathology refers to a phenotype 
observed for a KLK15 Related Protein homozygous or heterozygous mutant. 

A transgenic non-human animal mcludes but is not limited to mouse, rat, rabbit, sheep, hamster, dog, 
1 5 cat, goat, and monkey, preferably mouse. 

The invention also provides a transgenic non-human animal assay system which provides a model 
system for testing for an agent that reduces or inhibits a pathology associated with a KLK15 Related Protein, 
preferably a KLKIS Related Protein associated pathology, con^)rising: 

(a) administering the agent to a transgenic non-human animal of the invention; and 
2 0 (b) determining whether said agent reduces or inhibits the pathology (e.g. KLK15 Related Protein 

associated pathology) in the transgenic non-human animal relative to a transgenic non-human 
animal of step (a) which has not been administered the agent. 
The agent may be useful in the treatment and prophylaxis of conditions such as cancer as discussed 
herein. The agents may also be incorporated in a pharmaceutical composition as described herein. 

2 5 The following non-limiting example is illustrative of the present invention: 

Example 

Materials and Methods 
Identification of the new gene 

A contiguous map for the human kallikrein gene locus extending j&x)m the KLKl gene (centromere) 
30 to the KLK14 gene (telomere) (7,8,1 1,12,27) was constructed. Overlapping bacterial artificial chromosome 
(BAG) clones spanning this area were identified by screening of a human BAG library using different 
radiolabeled gene-specific probes. An area of ~ 300 kb of genomic sequence was established using different 
techniques, as previously described (1 1,27). By performing an EcoRl restriction analysis, the kallikrein locus 
was oriented along the EcoRl restriction map of chromosome I9ql3 available from the Lawrence Livermore 

3 5 National Laboratory (LLNL). A BAG clone that extends more centromencally (BG 781134) was then 

identified. Gontigs of linear genomic sequences from this clone are available from the LLNL. Initially, these 
contig sequences were used to predict the presence of novel genes, using bioinformatic approaches, as 



wo 02/14485 



PCT/CAOl/01141 



-27- 

previously described (8,12), and a putative new serine protease was identified. The sequence of the putative 
gene was then verified by different approaches including sequencing, EST database search, PGR screening of 
tissues, as described below. 
Expressed sequence tag (EST) searching 
5 The predicted exons of the putative new gene were subjected to homology search using the BLASTN 

algorithm (28) on the National Center for Biotechnology Information web server (http://wv^ 
ncbi.nhn.nih.gov/BLAST/) against the human EST database (dbEST). Clones with > 95% homology were 
obtained from the IM.A.GJE. consortium (29) through Research Genetics Inc, Huntsville, AL. The clones wore 
propagated, purified as described elsewh^e (30) and sequenced from both directions with an automated 

1 0 sequencer, using insert-flanking vector primers. 

Prostate cancer cell line and hormonal stimulation experiments 

The LNCaP prostate cancer cell line was purchased from the American Type Culture Collection 
(ATCC), Rockville, MD. Cells were cultured in RPMI media (Gibco BRL, Gaithersburg, MD) supplemented 
with glutamine (200 nunol/L), bovine insulm (10 mg/L), fetal bovine serum (10%), antibiotics and 

1 5 antimycotics, in plastic flasks, to near confluency. The cells were then aliquoted into 24-well tissue culture 
plates and cultured to 50% confluency. 24 hours before the e3q)erimrats, the culture media were changed into 
phenol red-free media containing 10% charcoal-stripped fetal bovine serum. For stnnulation experiments, 
various steroid hormones dissolved in 100% ethanol w^e added into the culture media at a final concentration 
of 10"^ M. Cells stunulated with 100% ethanol were mcluded as controls. The cells were cultured for 24 hours, 

2 0 then harvested for mRNA extraction. 

Reverse transcriptase polymerase chain reaction (RT-PCR) for the KLK15 gene 

Total RNA was extracted from the LNCaP cell line or from prostate tissues using Trizol reagent 
(Gibco BRL) following the manufacturer's instructions. RNA concentration was determined 
spectrophotometrically. 2 ^g of total RNA was reverse-transcribed into first strand cDNA using the 

25 Superscript^ preanq)lification system (Gibcq BRL). The final volume was 20 |xl. Based on the combined 
information obtained from the predicted genomic structure of the new gene and the EST sequences (see below), 
two gene-specific primers vrere designed (KLK15-F1 - SEQ ID NO. 47 and KLK15-R1 - SEQ ID NO. 48 ) 
(Table 1) and PGR was carried out in a reaction mixture containmg 1 \il of cDNA, 10 mM Tris-HCl (pH 8.3), 
50 mM KCl, 1.5 mM MgCla, 200 ^M dNTPs (deoxynucleoside triphosphates), 150 ng of primers and 2.5 units 

30 of HotStar™ DNA polymerase (Qiagen Inc., Valencia, CA) on a Perkin-Ehner 9600 thermal cycler. The 
cycling conditions were 95'*C for 15 minutes to activate the Taq DNA polymerase, followed by 35 cycles of 
94°C for 30 s, 64X for 30 s, 72°C for 1 min and a final extension step at 72°C for 10 min. Equal amounts of 
PGR products were electrophoresed on 2% agarose gels and visualized by ethidiimi bromide staining. All 
primers for RT-PCR spanned at least 2 exons to avoid contamination by genomic DNA. To verify the identity 

35 of the PGR products, they were cloned into the pCR 2.1-TOPO vector (Invitrogen, Carlsbad, CA, USA) 
according to the manufacturer's instructions. The inserts were sequenced from both directions using vector- 
specific primers, with an automated DNA sequencer. 
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Tissue expression 

Total RNA isolated from 26 different human tissues was purchased from Clontech, Palo Alto, CA. 
cDNA was prepared as described above for the tissue culture experiments and used for PGR reactions. Tissue 
cDNAs were amplified at various dilutions using two gene-specific primers (KLK15-F2 - SEQ ID NO. 49 and 
5 KLK15-R1 - SEQ ID NO. 48) (Table 1). Due to the high degree of homology between kallikreins. and to 
exclude non-specific amplificaticn, PGR products were cloned and sequenced. 
Prostate cancer tissues 

Prostate tissue samples were obtained from 29 patients who had undergone radical retropubic 
prostatectomy for prostatic adenocarcinoma at the Chaiite Univ^sity Hospital, Berlin, Germany. The patients 

1 0 did not receive any hormonal therapy before surgery. The use of these tissues for research purposes was 
approved by the Ethics Committee of the Charite Hospital. Fresh prostate tissue samples were obtained fi-om 
the cancerous and non-cancerous parts of the same prostates that had been removed. Small pieces of tissue were 
dissected immediately after removal of the prostate and stored in liquid nitrogen until analysis. Histological 
analysis from all the tissue pieces was performed as previously described (31), to ensure that the tissue was 

1 5 either malignant or benign. The tissues were pulverized with a hammer under liquid nitrogen and RNA was 
extracted as described above, using Trizol reagent 
Statistical analysis 

Statistical analysis was performed with SAS software (SAS Institute, Gary, NC). The analysis of 
differences between KLK15 expression in non-canceious versus cancerous tissues from the same patient was 
2 0 performed with the non-parametric McNemar test. The binomial distribution was used to compute the 
significance level. Prostate tumor KLK15 mRNA levels were qualitatively classified into two categories 
(KLK15 - low and KLK15 - high groups) and associations between KLK15 status and other variables were 
analyzed using the Fisher's exact test 
Structure analysis 

2 5 Multiple alignment was perfbrmed using the "Glustal X" software package and the multiple alignment 

program available from the Baylor Gollege of Medicine, Houston, TX, USA. Phylogenetic studies were 
performed using the 'Thylip" software package. Distance matrix analysis was performed using the **Neighbor- 
Joining^UPGMA" program and parsimony analysis was done using the *Trotpars" program. Hydrophobicity 
study was performed using the Baylor College of Medicine search launcher. Signal peptide was predicted using 

30 the "SignalP" server. Protein structure analysis was performed by "SAPS" (structural analysis of protein 
sequence) program. 
Results 

Cloning of the KLK15 gene 

A contiguous map for the human kallikrein gene locus extending from the KLKl gene (centromere) 
35 to the KLK14 gene (telomere) was previously established (7,8,11,12,27). In order to investigate the presence 
of other kallikrein-like genes centromeric to KLKl, a BAG clone (BG 781 134) was obtained as described in 
materials and methods. According to the published genomic sequence of prostate specific antigen (PSA), and 



wo 02/14485 



PCT/CAOl/01141 



-29- 

human renal kallikrein (KLKl) genes, gene-specific primers were designed for each of these genes (Table 1, 
SEQ ID NOs. 47-50) and polymerase chain reaction (PCR)-based amplification protocols were developed 
which allowed generation of specific PGR products with genomic DNA as a template. PGR screening of the 
BAG clone by these gene-specific primers indicated that this clone is positive for KLKl but negative to PSA, 
5 thus, confinning its location to be centromeric to PSA. 

A putative new serine protease was predicted from the sequence of this clone by computer programs 
as previously described (12). This clone was digested, blotted on a membrane and hybridized with gene- 
specific primers for the putative KLK15 gene (according to the predicted sequence), and positive fragments 
were subcloned and sequenced to verify the structure of the putative gene. This putative gene sequence was 

1 0 then blasted against the human EST database and two EST clones were identified (GenBank accession 
#AW274270 and # AW205420). These two clones were 99% identical to the last exon and the 3 'untranslated 
region of the gene and the second EST ends with a stretch of 17 adenine (A) nucleotides that were not found 
in the genomic sequence, thus verifying the 3' end of the gene and the position of the poly A tail. 

To identify the full inRNA structure of the gene and to determine the exon/intron boundaries, PGR 

15 reactions were perfonned using primers located in different computer-predicted exons, using a panel of 26 
human tissue cDNAs as templates. PGR products woe sequenced. Two of these primers (KLKIS-Fl - SEQ 
ID NO. 47, and KLK15-R1 - SEQ ID NO. 48)) (Table 1) were able to amplify the fuU codmg region of the 
gene from different tissues. Gon^aring the mRNA with the genomic structure indicated the presence of a gene 
formed of five coding exons with 4 intervening introns. Translation of the mRNA sequence in all possible 

2 0 reading frames revealed the presence of only one frame that gives an uninterrupted polypeptide chain, that also 
contains the highly conserved structural motifs of the kallikreins, as discussed below. 
Structural characterization of the KLK15 gene 

As shown in Figure 1, the KLK15 gene is formed of 5 coding exons and 4 intervening introns, 
although, as with other kallikrein genes, the presence of further upstream untranslated exon(s) could not be 

2 5 ruled out (17,32,33). All of the exon Antron splice sites conform to the consensus sequence for eukaryotic 

splice sites (34). The gene frnther follows strictly the common structural features of other members of the 
human kallikrein multigene family, as described below. The predicted protem-coding region of the gene is 
formed of 771 bp, encoding a deduced 256 amino acid polypeptide with a predicted molecular weight of 28.1 
kDa. The potential translation initiation codon matches the consensus Kozak sequence (35), moreover, there 

3 0 is a purine at position (-3) which occurs in 97% of vertebrate mRNAs (36). It should also be noted, that like 

most other kallikrein-like genes, KLK15 does not have the consensus G nucleotide at position (h4). 

Nucleotides 7764-7769 (ATTAAA) (SEQ ID NO. 40) closely resemble a consensus polyadenylation 
signal (37) and are followed, after 17 nucleotides, by tiie poly A tail. No otiier potential polyadenyladon signals 
were disc^nable in the 3' untranslated region, suggesting that the above sequence is the actual polyadenylation 
3 5 signal. Although AATAAA (SEQ ID NO. 40) is highly conserved, natural variants do occur, and the ATTAAA 
(SEQ ID NO. 40) sequence is reported to occur as a natural polyadenylation variant m 12% of vertebrate 
mRNA sequences (38). The presence of glutamic acid (E) at position 203 suggests that KLK15 will likely 
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possess a unique substrate specificity. PSA has a serine (S) residue in the corresponding position and has 
chymotryptic-like activity. Many other kallikreins usually have aspartate (D) in this position indicating a 
trypsin-like activity (Figure 2) (6). 

Although the KLK15 protein sequence is unique, comparative analysis revealed that it has a 
5 considerable degree of homology with other members of the kallikrein multigene family. KLK15 shows 5 1 % 
protein identity and 66% sunilarity with the trypsin like serine protease (TLSP) and 49%, 48% identity with 
the neuropsin and KLK-L3 proteins, respectively. Hydrophobicity analysis revealed that the amino-terminal 
region is quite hydrophobic (Figure 3), consistent with the possibility that this region may harbor a signal 
sequence, analogous to other serine proteases. Computer analysis of the KLK15 protein sequence predicted 

10 a cleavage site between amino acids 16 and 17 (TAA-QD). Sequence alignment (Figure 2) also revealed 
another potential cleavage site (Lys^^), at a site homologous to the activation site of other serine proteases 
[lysine (K) or arginine (R) is present in most cases] (39). Several evenly distributed hydrophobic regions 
throughout the KLK15 polypeptide are consistent with a globular protein, similar to other kallikreins and serine 
proteases. Thus, as is the case with other kallikretos, KLK15 is presumably translated as an inactive 256 amino 

1 5 acid preproenzyme precursor. Prepro-KLK15 has 21 additional residues which constitute the pre-region (the 
signal peptide formed of 16 residues), and the propeptide (5 residues). 

The dotted region in Figure 2 indicates an 11-amino acid loop characteristic of the classical kallikreins 
(PSA, KLKL, and KLK2) but not found in KLK15 or other members of the kallikrein multi-gene family 
(10,11,13.14). However, KLK15 has a unique 8 amino acid loop (HNEPGTAG) (SEQ ED NO. 10) at positions 

2 0 148-155, not found in any other kallikrein (Figure 2). Twenty nine "invariant" amino acids surrounding the 
active site of serine proteases have been described (40), Of these, twenty eight are conserved in BCLK15. One 
of tiie unconserved amino acids (Ser^^ instead of Pro) is also found in prostase, KLK-L2 and KLK-L5 proteins, 
and represents a conserved evolutionary change to a protein of the same group, according to protein evolution 
studies (41). Twelve cysteine residues are present in the putative mature KLK15 protein; ten of them are 

2 5 conserved in all kallikreins, and would be expected to form disulphide bridges. The oth^ two (C13 1 and C243) 

are not found in PSA, KLKl, KLK2 or KLK-L4, however, tiiey are found m similar positions in all otiier 
kallikrein genes and are expected to form an additional disulphide bond. 

To predict the phylogenetic relatedness of the KLK15 protein with other serine proteases, the amino 
acid sequences were aligned together using the Unweighted Pair Group Method widi Arithmetic mean 

3 0 (UPGMA) and the Neighbor- Joining distance matrix methods, and the **Protpars" parsimony method. All 

phylogenetic trees obtained agreed that other serine proteases (non-kallikreins) can be grouped together as a 
separate group, indicating that kallikreins rqjresent a separate step in the evolution of serine proteases. KLK15 
was grouped with the KLK-L3 and TLSP (Figure 4) and the classical kallikreins (hKl, hK2, and PSA) are 
grouped togptiier in all trees, suggesting that the separation between classical kallikrems and the kallikrein-like 
3 5 genes occurred early during evolution, consistent witii suggestions of previous studies (13). 
Splice variants of the KLK15 gene 

PGR screening for KLK15 transcripts using gene-specific primers (KLK15-F2 -SEQ ID NO. 49 and 
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KLK15-R2 - SEQ JD NO. 50) (Table 1) revealed the presence of 3 bands in most of the tissue cDNAs 
examined (Figure 6). These bands were gel purified, cloned and sequenced. The upper band represents the 
classical form of the gene, and the lower band is splice variant 3 (Figure 7). The middle band represents two 
other splice variants. Restriction digestion of the PGR product of the middle band with Stu I, followed by gel 
5 separation, purification, and sequencing revealed that it is composed of splice variants 1 and 2 which have 
approximately the same length (splice variant 1 has exon 4 (137 bp) but is missing 118 bp from exon 3, while 
splice variant 2 has an additional 118 bp of exon 3 but missing exon 4. AU splice variants are expected to 
encode for truncated protein products (Figure 5). 
Chromosoinal localization of tKie KLK15 gene 

1 0 Restriction analysis study of a number of overlapping BAG clones spanning the human kallikrein locus 

followed by comparison with the EcoRl restriction map of the area (available from the LLNL web site) enabled 
identification of a BAG clone (EC 25479) that is telomerically adjacent to BC 781134 (which harbors the 
BCLK15 gene). Blasting the sequences of the two clones showed that the ends of these clones are overlapping. 
By identifying the position of the KLKl, KLK3 and KLK15 genes along these clones, the relative location and 

15 the direction of transcription of these three genes were precisely defined. KLKl is the most centromeric and 
its direction of transcription is from telomere to centromere, followed by KLK15, which is more telomeric and 
transcribes in the same direction. The distance between the two genes is 1501 bp in length. The KLK3 gene 
is more telomeric, located at a distance of 23,335 from the KLK15, and is transcribed in the opposite direction 
(Figure 6). These results are consistent with previous reports where the distance between KLK3 and KLKl was 

2 0 roughly estimated to be ~ 3 1 Kb (6,27). 

Tissue expression and hormonal regulation of the KLIQ5 gene 

As shown in Figure 7, the KLKl 5 gene is expressed at highest levels in the thyroid gland. Lower 
levels of expression are also seen m the prostate, salivary and adrenal glands, colon, testis and kidney. In order 
to verify the RT-FCR specificity, representative PGR products were cloned and sequenced. Figure 8 shows tiiat 
25 the KLK15 gene is up-regulated by steroid hormones in the human LNGaP prostate cancer cell line. 
KLK15 expression in prostate cancer 

The expression of the KLK15 gene in normal and cancerous prostatic tissues was examined by RT- 
PGR. Actin was included as a control gene to ensure the quality and amount of the cDNA used. In order to 
examine the relative expression of the KLK15 gene in normal compared witii malignant tissues, 29 pairs of 

3 0 prostatic tissues were examined. Each pair represented normal and cancerous tissue obtained from the same 

patient. The results are summarized in Table 2. Thirteen out of 29 patients had significantiy higher KLK15 
expression in the cancer tissue and only three had the expression of KLK15 higher in non-cancer than to cancer 
tissues. Analysis by the McNemar test indicated that the differoices betwera normal and cancerous tissues are 
statistically significant CP=0.021). Because of the small number of cases, the binomial distribution was used 
35 to compute the significance level. The prostate cancer patients were further classified into two groups: (a) 
KLK15 expression-positive (N= 21) and (b) KLK15 expression-negative (or w&cy low) (N= 8). When the 
association of KLKI5 expression was compared witii clmicopathological prognostic variables higher KLK15 
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expression was found to be more frequent in patients with late stage disease and tumours of higho' gi^de (Table 
3). 

Discussion 

Kallikreins are a subgroup of serine proteases. The term *kallikrein* is usually utilized to describe an 
5 enzyme that acts upon a precursor molecule (kininogen) for release of a bioactive peptide (kinin)(3.42). 
However, the generic term 'tissue kallikrein' is not restricted to the functional definition of the enzyme. This 
term is now used to describe a group of enzymes with highly conserved gene aiu! protein structure which also 
co-localize in the same chromosomal locus. Among the three classical human kallikrein genes, only KLKl 
encodes for a protein with potent kininogenase activity. The enzymes racoded by KLK2 and KLK3 genes have 
10 very weak kininogenase activity. Tht already cloned 14 members of the human kallikrein gene family have a 
number of similarities (7,11) as show below: 

• All genes localize to the same chromosomal region (19ql3,3-ql3.4) 

• All genes encode for putative serine proteases with a conserved catalytic triad in the appropriate positions, 
i.e., histidine near the end of the second coding exon, aspartic acid in the middle of the third exon, and serine 

15 at the beginning of the fifth (last) exon. 

• All genes have five coding exons (some members contain one or more 5/- untranslated exons). 

• Coding exon sizes are similar or identical. 

• Intron phases are fully conserved. 

• All genes have significant sequence homologies at the DNA and amino acid levels (30-80%). 
20 • Many of these genes are regulated by steroid hormones. 

Figures 2 and 8 show that the newly identified KLKIS gene shares all the above similarities and is 
thus a new member of the human kallikrein multigene family. This gene was named KLK15. 

Many kallikrein genes are related to the pathogenesis of human diseases, depending on the tissue of 
their primary expression. The KLKl gene is involved in many disease processes, including inflammation (3), 

2 5 hypertension (44), renal nephritis and diabetic renal disease (45,46), The connections of HSCCE (KLK7) with 

skin diseases, including pathological keratinization and psoriasis, have ahready been reported (47,48). Little 
et al. suggested that zyme (KLK6) may be amyloidogenic and may play a role in the development of 
Alzheimer's disease (14). There are other reports describing cormection of neuropsin (KLKS) expression with 
diseases of the central nervous system, including epilepsy (49.50). Being primarily expressed in the thyroid, 

3 0 KLK15 may play an important role in the normal physiology and pathophysiology of this gland. Among all 

other discovered kallikreins, many are expressed in the thyroid but none at highest levels in this tissue (7,11) 
The KLK15 gene is up-regulated, at the mRNA level, in a subset of prostate cancers. The distributions 
of KLK15 qualitative expression status (high or low) between subgroups of patients differing by disease stage, 
tumor grade and Gleason score indicated that high KLK15 expression was found more frequently in Grade 3 
3 5 tumors as well as in stage in and Gleason score >6 patients. These findings indicate that overexpression of 
KLK15 is associated with more aggressive forms of the disease and may be an indicator of poor prognosis 
(Tables). 
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There is now growing evidence that many kallikreins and kallikrein-like genes are related to 
malignancy. PSA is the best marker for prostate cancer so far (20). Recent reports suggest that hK2 (encoded 
by the KLK2 gene) could be another useful diagnostic marker for prostate cancer (21^1). NESl (KLKIO) 
appears to be a novel tumor suppressor gene (23). The zyme (KLK6) gene was shown to be differentially 
5 expressed in primary breast and ovarian tumors (24), and the human stratum comeum chymotryptic enzynae 
(HSCCE, KLK7) has been shown to be e^ressed at abnormally high levels in ovarian cancer (25). Another 
recently identified kaUikrein-like gene, tentativdy named the tumor-associated differentially expressed gene-14 
(TADG-14)/neuropsin) (KLK8) was found to be overexpressed in about 60% of ovarian cancer tissues (26). 
Pprostase/KLK-Ll/ (KLK4), another newly discovered kallikrein-like gene, is speculated to be linked to 

1 0 prostate cancer (13). Two newly discovered kallikreins, KLK-L4 (KLK13) and KLK-L5 (KLK12). were also 
found to be downregulated in breast cancer (10). Thus, extensive new literature suggests multiple connections 
of various kallikrein genes to many forms of human cancer. 

The existence of multiple alternatively spliced mRNA forms is frequent among the kallikreins. 
Distinct RNA species are transcribed from the PSA gene, in addition to the major 1 .6 kb transcript (19,52.53). 

15 Also, Reignoan et al reported the identification of two alternatively spliced forms of the human glandular 
kallikrein 2 (KLK2) gene (54). A novel transcript of the tissue kallikrein gene (KLKl) was also isolated from 
the colon (55). Neuropsin, a recently identified kallkrein-like g^ie, was found to have two alternatively spliced 
forms, in addition to the major form (26,56). KLK-L4 was also found to have different alternatively spliced 
forms (10). Because the splice variants of KLK15 have an identical 5* sequence required for translation, 

2 0 secretion and activation, it is possible to assume that they encode for a secreted protein (53). 

In conclusion, a new member of the human kallikrein gene family, KLK15, has been characterized 
which maps to the human kallikrein locus (chromosome i9ql3.3-ql3.4). This gene has three related splice 
forms in addition to the classical form. KLK15 is expressed in a variety of tissues but predominantiy in the 
thyroid, it appears to be up-regulated in more aggressive forms of prostate cancer and its expression is 

2 5 influenced by steroid hormones. Since a few other kallikrems are already used as valuable tumor markers, 

KLKI5 may also find similar clinical applications. 

Having illustrated and described the principles of the invention in a pref^red embodnnent, it should 
be appreciated to those skilled in the art that the invention can be modified in arrangement and detail without 

3 0 departure from such principles. All modifications coming within the scope of the following claims are claimed. 

All publications, patents and patent applications referred to herein are incorporated by reference in 
their entirety to the same extent as if each individual publication, patent or patent application was specifically 
and individually indicated to be incorporated by reference in its entirety. 
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Table 1. Primers used for genomic PGR amplification. 



Gene 


Primer name and sequence 


GenBank 
accession # 


KLKl 


KliKl-A: ATC CCT CCA TTC CCA TCT TT 
KLKl-B CAC ATA CAA TTC TCT GGT TC 


L10038 


KLK2 


KLK2-A: AGT GAC ACT GTC TCA GAA TT 
KIjK2-B: CCC CAA TCT CAC GAG TGC AC 


M18157 


PSA 


E5-A: GTC GGC TCT GGA GAC ATT TC 
E5-B: AAC TGG GGA GGC TTG AGT C 


M27274 


KLK15 


KLK15-P1 CTC CTT CCT GCT GGC ATC CA 
KLK15-R1 ATC ACA CGG GTG GTC ATG TG 
KLK15-F2 CAA GTG GCT CTC TAG GAG CG 
KLK15-R2 GAG ACC AGG CTT GGT GGT GT 


AF242195 



* all primers are presented in 5' — ► 3' direction. 
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Table 2. KLK15 expression in 29 pairs of cancerous and non-cancerous prostatic tissues. 


KLK15 Expression 


Number of patients P value * 


Higher in cancer vs. nonnal 


13 


Lower in cancer vs. normal 


3 


High expression but approx. equal in both 


8 


tissues 




Low (or no) expression but approx. equal in 


5 0.021 


both tissues 





* P value was calculated by the McNemar test using the binomial distribution. 
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Table 3, Relationship between KLK15 expression and other clinicopathological 
vanables in 29 patients with primary prostate cancer 



Variable 



Stage 
MI 

m 

Grade 
Gl/2 
G3 

Gleason score 
<6 
>6 

Unknown 



Patients 



No. of p atients (%) 
KLK15 negative KLKl 5 positive 



P 

value* 



20 


8(40) 


12 (60) 




9 


0(0) 


9 (100) 


0.033 


23 


8 (34.8) 


15 (65.2) 




6 


0(0) 


6(100) 


0.15 


22 


7 (31.8) 


15 (62.2) 




6 
1 


0(0) 


6 (100) 


0.14 



*Fisher*s Exact Test. 
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We Claim : 

1. An isolated KLK15 nucleic acid molecule of at least 30 nucleotides which hybridizes to one or more 
of SEQ, m. NO. 1 through 5 or 10 through 24, or the complement of one or more of SEQ ID NO. 1 

5 through 5 or 10 through 24 under stringent hybridization conditions. 

2. An isolated nucleic acid molecule which comprises: 

(i) a nucleic acid sequence encoding a protein having substantial sequence identity with an amino 
acid sequence of SEQ. ID. NO. 6, 7, 8, or 9; 

(ii) a nucleic acid sequence encoding a proteui comprising an amino add sequence of SEQ. ID. 
10 NO. 6, 7, 8, or 9; 

(iii) nucleic acid sequences complementary to (i) or (ii); 

(iv) a degenerate form of a nucleic acid sequence of (i) or (ii); 

(v) a nucleic acid sequence capable of hybridizing under stringent conditions to a nucleic acid 
sequence in (i), (ii) or (iii); 

15 (vi) a nucleic add sequence encoding a truncation, an analog, an allelic or species variation of 

a protein comprising an amino acid sequence of SEQ. ID. NO. 6, 7, 8, or 9; or 
(vii) a fragment, or allelic or species variation of (i), (ii) or (iii). 

3. An isolated nucleic add molecule which conoprises: 

(i) a nucldc acid sequence comprising the sequ^ce of one or more of SEQ.ID J^O. 1 through 
20 5 or 10 through 24, wherein T can also be U; 

(ii) nucleic acid sequences complementary to (i), preferably complementary to the full nucleic 
acid sequence of one or more of SEQ.BD.NO. 1 through 5 or 10 through 24; 

(iii) a nucleic acid capable of hybridizing under stringent conditions to a nucleic acid of (i) or (ii) 
and preferably having at least 18 nucleotides; or 

2 5 (iv) a nucleic acid niolecule differing from any of the nucleic acids of (i) to (iii) in codon 

sequences due to the degeneracy of the genetic code. 

4. A vector comprising a nucleic acid molecule of any of the preceding claims. 

5. A host cell comprising a nucleic add molecule of any of the preceding claims. 

6. An isolated protein comprising an amino acid sequence of SEQ. ID. NO. 6, 7. 8, or 9. 
30 7. A method for preparing a protein as claimed in claim 6 comprising: 

(a) transferring a vector as claimed in claim 4 into a host cell; 

(b) selecting transformed host cells from untransformed host cells; 

(c) culturing a selected transformed host cell under conditions which allow expression of the protein; 
and 

3 5 (d) isolating the protein. 

8. A protein prepared in accordance with the method of claim 7 . 

9. An antibody having specificity against an epitope of a protein as claimed in claim 6. 
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10. An antibody as clakned in claim 9 labeled with a detectable substance and used to detect the polypeptide 
in biological samples, tissues, and cells. 

11. A probe comprising a sequence encoding a protein as claimed in claim 6, or a part thereof, 

12. A method of diagnosing and monitoring a condition associated with a protein as claimed in claim 6 by 
5 determining the presence of a nucleic acid molecule as claimed in any of the preceding claims or a protein 

as claimed in any of the preceding claims. 

13. A method as claimed in claim 12 wherein the condition is cancer. 

14. A method for identifying a substance which associates with a protein as claimed in claim 6 comprising 

(a) reacting the protein with at least one substance which potentially can associate with the protein, 
1 0 under conditions which permit the association between the substance and protein, and 

(b) ' removing or detecting protein associated with the substance, wherein detection of associated 
protein and substance indicates the substance associates with the protein. 

15. A method for evaluating a compound for its ability to modulate the biological activity of a protein as 
claimed in claim 6 comprising providing the protein with a substance which associates with the protein 

1 5 and a test compound under conditions which permit the formation of complexes between the substance 

and protein, and r^noving and/or detecting complexes. 

16. A method for identifying inhibitors of a KLK15 Related Protein interaction, comprising 

(a) providing a reaction mixture including tiie KLK15 Related Protein and a substance that binds 
to the KLK15 Related Protein, or at least a portion of each which interact; 
20 (b) contacting the reaction mixture with one or more test compounds; 

(c) identifying compounds which inhibit the interaction of the KLK15 Related Protein and 
substance. 

17. A method for detecting a nucleic acid molecule encoding a protein comprising an amino acid sequence 
of S£Q. ID. NO. 6, 7, 8, or 9 in a biological sample comprising the steps of: 

25 (a) hybridizing a nucleic acid molecule of claim 1 to nucleic acids of the biological sample, 

thereby forming a hybridization complex; and 
(b) detecting the hybridization comply wherein the presence of the hybridization complex 
correlates with the presence of a nucleic acid molecule encoding the protein in the 
biological sample. 

30 18. A method as claimed in claim 17 wherein nucleic acids of the biological sample are amplified by the 
polymerase chain reaction prior to the hybridizing step. 
19. A method for monitoring the progression of cancer in an individual, comprising: 

(a) contacting an amount of an antibody which binds to a KLK15 Related Protein, with a sample from 
the individual so as to form a binary complex comprising the antibody and KLK15 Related Protein 

35 in the sanq)le; 

(b) determining or detecting the presence or amount of complex formation in the sample; 

(c) repeating steps (a) and (b) at a point later in time; and 
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(d) comparing the result of step (b) with the result of step (c), wherein a difference in the amount of 
complex formation is indicative of the progression of cancer in said individual. 
20. A method for treating a condition mediated by a protein as claimed in claim 6 comprising administering 
an effective amount of a compound identified in accordance with a method claimed in claim 15 or 16. 
5 21 . A method as claimed in claim 20 wherein the condition is cancer. 

22. A composition comprising one or more of a nucleic acid molecule or protein claimed in any of the 
preceding claims, or a substance or compound identified using a method as claimed in any of the 
preceding claims, and a phannaceutically acceptable carrier, excipient or diluent. 

23. Use of one or more of a nucleic acid molecule or protein claimed in any of the preceding claims, or a 
10 substance or compound identified using a method as claimed in any of the preceding claims in the 

preparation of a pharmaceutical composition for treating a condition mediated by a protein as claimed in 
claim 6, or a nucleic acid molecule as claimed in claim 1. 

24. A method of conducting a drug discovery business comprising: 

(a) providing one or more assay systems for identifying agents by their ability to inhibit or 
15 potentiate the interaction of a KLKIS Related Protein and a substance that binds to the 

KLK15 Related Protein; 

(b) conducting therapeutic profiling of agents identified in step (a), or further analogs thereof, 
for efficacy and toxicity in animals; and 

(c) formulating a pharmaceutical prqjaration including one or more agents idratified in step (b) 
20 as having an acceptable therapeutic profile. 

25. A vaccine for stimulating or enhancing in a subject to whom the vaccine is administered production of 
antibodies directed against a protein as claimed in claim 6. 

26. A method for stimulating or enhancing in a subject production of antibodies directed against a protein as 
claimed in claim 6. 

25 27. A method as claimed in claim 26 comprising administering to the subject a vaccine as claimed in claim 
25 in a dose effective for stimulating or enhancing production of the antibodies. 
28. A method for tzeating, preventing, or delaying recurrence of cancer comprising administering to the subject 
a vaccine as claimed m claim 25 in a dose effective for treating, preventing, or delaying recurrence of 
cancer. 

30 
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Figure 1 

TGGATTCTCTCACTCCCTCCCCAGACTGCAGCCGAACCCTGGTCCCTCCTCCACA 
(^^TGG CTT CTC CTC ACT CTC TCC TTC CTG CT6 GCA TCC ACA 
M WLLLTLSFLLAST 

G gtgaggtggccccaggagggggcGaggtctgtgggagcaggtg 

...Intronl-. « «.gcatcctctaccccttctcttag CA GCC CAG 

A A Q 

GAT GGT GAC AAG TTG CTG GAA GGT GAC GAG TGT GCA CCC CAC 

D GDKLLEGDECAPH 
TCC CAG CCA TGG CAA GTG GCT CTC TAC GAG CGT GGA CGC TTT 

SQPWQVALYER6RF. 
AAC TGT GGC GCT TCC CTC ATC TCC CCA CAC TGG GTG CTG TCT 

NCGASLISPHWVLS 
GCG GCC CAC TGC CAA AGC CG £tatgaaggcaggggctcagggtcctga 

A A [h1 C Q S R 

ggga ^Intron 2 cgcactccactggcgggaaa 

accactcgcccgcacag C TTC ATG AGA GTG CGC CTG GGA GAG CAC 

FMRVRLGEH 
AAC CTG CGC AAG CGC GAT GGC CCA GAG CAA CTA CGG ACC ACG 

NLRKRDGPEQZiRTT 

tct cgg gtc att cca cac ccg cgc tac gaa gcg cgc agc cac 
srviphpryearsh 

cgc aac gac atc atg ttg ctg cgc cta gtc cag ccc gca cgc 
rnHimllrlvqpar 

ctg aac ccc cag gtg cgc ccc gcg gtg cta ccc acg cgt tgc 

LNPQVRPAVIiPTRC 
CCC CAC CCG GGG GAG GCC TGT GTG GTG TCT GGC TGG GGC CTG 

PHPGEACVVSGWGIi 
GTG TCC CAC AAC GAG CCT GGG ACC GCT GGG AGC CCC CGG TCA 

VSHNEPGTAGSPRS 
CAA G g t gcgtgaaagga t ggagc tgga t .Intron 3 

Q 

ctccaagtccactgtcttccccag TG A6T CTC CCA GAT ACG TTG CAT 

VSLPDTLH 
TGT GCC AAC ATC AGC ATT ATC TCG GAC ACA TCT TGT GAC AAG 

CANISIISDTSCDK 
AGC TAC CCA GGG CGC CTG ACA AAC ACC ATG GTG TGT GCA GGC 

SVPGRLTNTMVCAQ 
GCG GAG GGC AGA GGC GCA GAA TCC TGT GAG £tcagagcctagagg 

AEGRGAESCE 

ggccatcaggcggaagaagaggg Intron 4 cct 

gagaccccctcttttccccacag GGT GAC TCT GGG GGA CCC CTG GTC 

gdIsIggplv 
tgt ggg ggc atc ctg cag ggc att gtg tcc tgg ggt gac gtc 
cggilqgivswgdv 
cct tgt gac aac acc acc aag cct ggt gtc tat acc aaa gtc 

PCDNTTKPGVYTKV 
TGC CAC TAC TTG GAG TGG ATC AGG GAA ACC ATG AAG AGG AAC 
CHYLEWI RETMKRN 

^G^TATTCTAGCCTATCTCCTGTGCCCCTGACTGAGCAGAAGCCCCCACAGCTGGCCAGCAGCCC 
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Figure 1 Cont'd 



CGCCTGACATGGAACAGAACGGAGCCATCCCCCAAGACCCTGTCCAAGGCCCAGATGTTAGCCAAGG 

ACTTGTCGCACCTGAGGACAAAGCTGGCGCTCAAGGTCACCTGTTTAATGCCAAGATAACAAAGCGC 

TGATCCAAGTTGCTCTGTAGGAATTTCTGTGACTTTTTTCTGGGGTCAAAGAGAAACCCCGAGACAC 

TGTACACTGTTCCTTTTCACCCACCACCCCGATCCCTAGGTGAGGAGAAGCGGCTTGAAGCAGGGCT 

CCATTCATTCAACACACATGACCACCCGTGTGATCTTGAACAAGAGGCCCAATCTCACTTCGCCTTG 

GTTTCCTTATCTGTAAAATGAGACCATCTTATTGCTGACTTCAAAGGGCTGTTGTC 

AGATGATTCGTCTGAACTGATTAAAATCGTGTCTGGCACTGA 
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Figure 2 
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Figure 3 
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Figure 4 
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Figure 6 
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Figure 8 
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SEQUENCE LISTING 
SEQ ID NO 1 

KLK15 full genomic structure 

1 agaatgggtg ctgtgggatt caggggagac acctgttagg tgttggggcc tcccagaaga 
61 ggtgggggca gagtgtcaga ggacaaagat gaatttggaa gatatgggga agaaggattt 
121 caattcaccc tcaaagcttc ctgaggcctc ccgtgggtcg ggccctgcag tactggagac 
181 ccagagtgga gtcagaccag ctcctcgggg agctgccagt ctcgtagggg aggcagacac 
241 cactgagggt caggggaggt cagagaaggc ctcaaggagg aagcggggct ggaagggaat 
301 ggcgttggat atgcggtggg aggaatagcc taagcatgaa atggcaggag ggaaaatggc 
361 agcactggct gcgtctagga caaggtcatg ggagacccag ggagaggggc tggaagggaa 
421 gaagccactt ttgtccttga aagtgaggct ggagccaggc aactcatgcc tgtaatccca 
481 gcactttggg aggctgaggc gggtggatca ctagaggtca ggagttcaag accagcctgg 
541 ccaacatggt gaaactccgt ctctactaaa attacaaaaa ttagctgggc gtggtggcac 
601 acacctgtaa tcccaattgc ttgggaggct gaggcaggag aatctcttga acccagaagg 
661 cagaggttac agtgagcgga gatcacgcca ctccactcca acctgggcta cagagccaga 
721 ctccgtctca aaaaaaaaaa aaaaaaagaa aaaaaaagaa agaaagtgaa tttgaagagc 
781 tggactttat cctggtggtg ccaaggatcc atggagggtg gtgagcaggg gaggggcaca 
841 gccagctcca gatgtagaaa gaccctttgg ggtcatggct ggagggcaag ctggtggagg 
901 ggactggact ggagggggac ccaaaaggcc agataagagg gttgagatag accaggcgcg 
961 gtggctcatg cctgtaatcc cagcactttg ggaggccgag gtgggtggat catgaagtca 
1021 agagattgag gccatcctgg ctaacacggt gaaaccctgt ctctacttaa aaaaaaaaaa 
1081 tttccaaaaa attagccggg cacggtggtg ggcgcctgta gtcccagcta ctcgggaggc 
1141 tgaggcggga gaatggtgtg aacctgggag gtggagcttg cagtgagccg acattgtgcc 
1201 actgcactcc agcctgggtg acagagtgag actccgtctc aaaaaaataa aaaaagttgg 
1261 gacagggggt ccttgcgtga tgatggagag agatccaccc gctggtagca tggtgctgga 
1321 ggctgacagg tggaggaggt ggggcagggt ctgtccgagt gcctagagga agagtaaacc 
1381 ttccagagat gggggaccca gaaggaagcg cagagtgggg ttgggggaag gggataccgg 
1441 tggtcagaag aaatttatta acagtggatg ggataagtct gtgtctggag ggatcctggt 
1501 ggaggcagaa gggtcctgcc tcacctggat tctctcactc cctccccaga ctgcagccga 
1561 accctggtcc ctcctccaca atgtggcttc tcctcactct ctccttcctg ctggcatcca 
1621 caggtgaggt ggccccagga gggggccagg tctgtgggag caggtgcccc cttcccaagc 
1681 atgtctgggc ccagtgatct gccagcccct acctcaccca gagaccacta aagatccttc 
1741 cttcaccctc cacctgtgcc aatgtcccta agcccttacc gtcaggtgct ggtgctgctg 
1801 ctctggagtc gctatgttgc ctggggcctc tcgctgccca cgacaaggaa cacggtcctg 
1861 gggttacaca aacctgagct gagtcctggg gcaaccgctt ccttgctgtg tgtccttgag 
1921 ggaactgctt cacctctctg ggcttcgaat gccttctcta taagacagca cccacttgag 
1981 acaataacag tgaggtctca atagcataac agaggtaata tacatagcaa gcattagaca 
2041 agtgctgaga ggccaacagc acagacagac tccagcttga gtcccacacc tgccactccc 
2101 tgtctcttac agggtctttg aggggattaa atgtggttgt gtgtgaggca gaagcataag 
2161 cctggcccag gtagtgcccc ttcaggtgtg caagccaggc acggtgctta gagcttacat 
2221 acaacgtcta tgtgtggtgg gcaccaccga cctcatttga caagggaagg ggctgtggct 
2281 cagagggacg gccacaacat caaggtcacc ttgggtgtca ggcaaactcc agattgaact 
2341 cagctgccac acaccaagaa attaattgta acctgatgcc tctcttctgg agaaattggg 
2401 gggtggactt tcattaacgt tctgccacaa atgaccctca ctcctggggg cccctgagac 
2461 ccccacgcct ccagcctccc ctccggctct ctctgtgcac tcacctacct gcctcgcgcc 
2521 tgcctgctgc gcccagctgg ggcctccacc ttcctctggc ttggactggc caggtgcagc 
2581 ctcggtgccc agctgttcag cccgtaccct ccgcccttcg gaggacgacc tcacccttcc 
2641 tttgttaagc cccttgtcca ccacatccgc attcccctgg tctcacgggg gcctttggcc 
2701 cagttcctga ctgtgatggg gagagtgtgg gcatttggtc tggctgtgca aatcctgccc 
2761 ctgtgtgggt gggagtgtgc atggcttcaa ccttcagggg atgcatccac attgcccagt 
2821 ggagaggggt cctggtcctg tgaccttgaa tgtctctaat catgtcctta agcataatgc 
2881 cattctgtgt gtgtgtgtgt gtgtgtgtgt gtacatgcac gtgtgcagtg ggtatacaag 
2941 gccctgtatg ttcacatcct ctccacatgc atgagccaga tccccatatg tgaaacccaa 
3 001 tcagtgactc cacagatctg gcttgggggc tgatctagag atggataaat atgtcctgcc 
3 061 ctggctgcct ctggcttcag ctgcatgtct ttgaccttga atgcccagcc ccgtgtctgg 
3121 gtgctgcccc agacagcaag tccacatctg agtgttggcc ttctgggttg gtgtctgcag 
3181 ctctaactct acaaaatgtc ttgtgggtga atcacggttt taaccttgac ttttttttgt 
3241 ttgtttggtt ttttttgaga cggagtctcg ctctgccgcc caagctggag ttcagtggtg 
3301 caacctcagc tcactgcaac ctccgcctcc caggttcaag caattctgtc tctgcctccc 
3361 gagtagctag aattacaggc acgcaccacc acgcccagct gatttttgta tttttattta 
3421 tttatttatt tattttttag tagagacggg atttcacgat gttggccagg ctggtctcaa 
3481 actcctgacc tcaggtgatc cacccacctc ggccttggcc tcccaaagtg ctgggattac 
3541 aggcgtgagc caccacacct ggccaacctt gactatttat tataggtaat tctgtgcaga 
3601 tgtctgactt atgttggcca tctccaggat ggacctgaac tttcacacgt atgtccctgt 
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3661 gactaaatcc aggtgtcatt tgcaaaaaac 
3721 aggtatcact caccatacat acacacatgc 
3781 cttacaacaa tcttcatttt acagatgagg 
3841 actcaaagtt tcacagctag tacattcgaa 
3901 ccctgtatgc aagtgtctgt gacactggat 
3961 aggtctgaac aatatccaat tctgtgtgtg 
4021 tattcatgtc ttaaccatcc atattcatat 
4081 tttttttttt tttttttttt tttgagatgg 
4141 caatggagca acctccgctc actgcaatct 
4201 ctcagcctcc agagtagctg ggattacagg 
4261 tatttttgtt agagacaggg tttccccata 
4321 caggtgatcc acccgcctcg gcctcccaaa 
4381 cccagcctgt gctgtgattc ttgaagctgc 
4441 tccagtcctg tccatagctg tacctaagtg 
4501 atgaccttgt atagccacat ctgggactca 
4561 agactctggg gccaaggctg ggtcacacat 
4621 gacaatttgg tgaccgtgaa tgactggttt 
4681 gtgcccctgt ctccaccccc aaccacagag 
4741 ctctctctcc cagagtctta tagcaaatgg 
4801 cagcggttgt aaataaacaa cagggcaggc 
4861 gcactttggg aggctgaggc gggcagagca 
4921 taacatggtg agacctcgtc tctactaaaa 
4981 cacctcagct actcgggagg ctgaggcagg 
5041 gcagtgagct gagatcatgc cactgcactc 
5101 caaaaacaac aacaacaaca aaacaaaaaa 
5161 aaggcatagg catatagtag ttagggcagg 
5221 cgtccctgtc ctcaggcatc ctctacccct 
5281 gctggaaggt gacgagtgtg caccccactc 
5341 tggacgcttt aactgtggcg cttccctcat 
5401 ctgccaaagc cggtatgaag gcaggggctc 
5461 agagctccta gatttggggg aagacggagg 
5521 cgaggaggcc ggatgtcaag cccctgggtt 
5581 tctctgaagg aggaaggaga agactagttc 
5641 agaatcctgg attcggggac agaccaggag 
5701 tctaggagtg tgcctgactt cagactcgtt 
5761 ggcttcaggg tcttgggaaa aggtaatggg 
5821 tcgggttgcc cactctttga tctttctgtc 
5881 tccactggcg ggaaaaccac tcgcccgcac 
5941 aacctgcgca agcgcgatgg cccagagcaa 
6001 ccgcgctacg aagcgcgcag ccaccgcaac 
6061 gcacgcctga acccccaggt gcgccccgcg 
6121 gaggcctgtg tggtgtctgg ctggggcctg 
6181 agcccccggt cacaaggtgc gtgaaaggat 
6241 atgctccagg gctcttgggc ggaggggaca 
6301 ccactgtctt ccccagtgag tctcccagat 
6361 tcggacacat cttgtgacaa gagctaccca 
6421 ggcgcggagg gcagaggcgc agaatcctgt 
6481 gaagaagagg gatggggaca ggtgtgggag 
6541 ggccagagaa gatgctaggg ttaggcttgg 
6601 ggtgaggttg gagttggggt tataggtggg 
6661 tagttagttt gagatggcat gggttggggt 
6721 ggtgggaaat acgtcagggt tgaattggga 
6781 catgaagatt gagattggat tttgagatgg 
6841 ggatgtgggc tgagttggat ttaacttagt 
6901 attggatata ggttgggtga gttgtattga 
6961 tgggttggct ctgtttggga taaactgggc 
7021 tgggatgggg atggattggg tttggggtga 
7081 atccaggagg tttcactcaa cctgagaccc 
7141 acccctggtc tgtgggggca tcctgcaggg 
7201 caacaccacc aagcctggtg tctataccaa 
7261 aaccatgaag aggaactgac tattctagcc 
7321 ccccacagct ggccagcagc cccgcctgac 
7381 cctgtccaag gcccagatgt tagccaagga 
7441 caaggtcacc tgtttaatgc caagataaca 
7501 ttctgtgact tttttctggg gtcaaagaga 
7561 cacccaccac cccgatccct aggtgaggag 
7621 aacacacatg accacccgtg tgatcttgaa 



aactaatatt attaagtagc taccagggct 
acacacacac atacacattc ctacctcatc 
aaacagaggc acagacaggt cgaataactt 
cccaggctta aggacccatc tttgtccaga 
gccaagactc acactagaga tgttgaattt 
tgtttgtgtg tgcatgtgtg tgtgtgtatg 
acacatatga acatctgtgc tgtgattctt 
agtttcactc ttgtcaccca ggctggagtg 
ccgcctcccg ggttcaagcg attttcctgc 
cacccgccac catgcccagc taattttttg 
ttggccaggc tggtctcgaa ctcctgacct 
gtgctgggat tacaggcatg agccaccgtg 
aacccatgtg catgcaagtg aatttcagct 
tggaagctgg atgtgcatgt atgcatgtcc 
tactgcacac tgaatttggc tgacatgtcc 
actgagtggc cacatgcgtt tgacgtctgt 
caagtgacca cctgtctgaa cctgtatcca 
gacttcttgc cctctggtct gttccccttc 
ggtgggggct agagttctgg agaaaacagg 
ggagcatggt ggctcacacc tgtaatccca 
tttgaagtca gaagtttgag actacctggc 
atacaaaaat tagccaggtg tggtggcggg 
aggatcactt gaacccagga ggcggaagtt 
cagcctgggc aaaagagtga gactccgtct 
cagggcaggg tgtcttgaga agttagggga 
gtgcaaggaa ggtgtaggag gcaatgtaaa 
tctcttagca gcccaggatg gtgacaagtt 
ccagccatgg caagtggctc tctacgagcg 
ctccccacac tgggtgctgt ctgcggccca 
agggtcctga gggagcctgg ttcgggggga 
cagacgccag aactcctggg ttctgaaaga 
aggaaggagt gtgtgtttca aagccttcga 
cagcttttga gcctcagttc tagggatgtg 
ggggctggga gtagttggag gggatcgagt 
ggtccttgag gagcaggggc tggaaccatt 
atgtcgagat ttctaaaggg tcgggagacc 
ctctacttgc gggtaaccac tggcccgcac 
agcttcatga gagtgcgcct gggagagcac 
ctacggacca cgtctcgggt cattccacac 
gacatcatgt tgctgcgcct agtccagccc 
gtgctaccca cgcgttgccc ccacccgggg 
gtgtcccaca acgagcctgg gaccgctggg 
ggagctggat gcgaggcctc aaggaatcct 
agggccggaa tttatggatc tgctccaagt 
acgttgcatt gtgccaacat cagcattatc 
gggcgcctga caaacaccat ggtgtgtgca 
gaggtcagag cctagagggg ccatcaggcg 
tccggatggg gttggatttt ctttgctttg 
agatggagta ggaagagaag ttagaatagg 
gattgcgttg tttgaggtgg ataactgtga 
tgagaatggg aatggtttgg tttgattctg 
tgaggtagat tttgtttgga atgcagaaga 
gcatgggttt gatttgattt tgaatggtga 
acagttgcac tggagttgca tgggggtgag 
gctgtgttga attggggttg gggttggggt 
tgtattgagt tgagttgggt tggggttccc 
gattgcaaat ggtgattagg atgaggatga 
cctcttttcc ccacagggtg actctggggg 
cattgtgtcc tggggtgacg tcccttgtga 
agtctgccac tacttggagt ggatcaggga 
tatctcctgt gcccctgact gagcagaagc 
atggaacaga acggagccat cccccaagac 
cttgtcccac ctgaggacaa agctggcgct 
aagcgctgat ccaagttgct ctgtaggaat 
aaccccgaga cactgtacac tgttcctttt 
aagcggcttg aagcagggct ccattcattc 
caagaggccc aatctcactt cgccttggtt 
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7681 tccttatctg taaaatgaga ccatcttatt gctgacttca aagggctgtt gtgaggatta 
7741 aatgagatga ttcgtctgaa ctgattaaaa tcgtgtctgg cactgagtaa ataccctcta 
7801 tctctggatc ccagttaaag gacctaacag acactagatt accaagaatg gctttttctt 
7861 taaggtttag ttctgggccg ggcatggtgg ctcacacctg taatcccagc actttgggag 
7921 gccaaggcgg gcggctcact tgaggtcagg agtgcaagac cagcctggcc aacatggtga 
7981 aaccccatct ctactaaaaa tactaaaaaa atttagccgg gcgtggtggc acacgactgt 
8041 aatcctagct acttgggagg gtgatgtggg aggatcgctt gaacttagga ggcaggagtt 
8101 gcagtgagcc gagatcgcgc cactgcactc cagcctggtg acagagcaag actccatctc 
8161 agaaaaaaaa aaaaaaaaaa aaagatttag ttctgggctt cctggtagcc atggcaaaaa 
8221 ggcaaatact gtcctttcct tagccaggtc cctgatatac agcagaggct ggaactctga 
8281 gctgctttga ttttaccaaa aagccaagac aacctgttgg aagcctatgg gtttaccatt 
8341 gaggctgcag gaatctagtt cctaattatc ttcagagacc acaaaatgtg atgttcaagg 
8401 tcgctgaatg ttgaagtaca tgaacctggc tcgtgagacc taaatattgt actggtggtg 
8461 ggggggaagg gtcattggaa tctgtggtta gcctgatctt gacctgcgag ggaaggttgt 
8521 ccagatctct ggactttgga ggaccgacgt tgagcaccat aatgggagca gaagtgcgag 
8581 gtctttgaga ccccgcttgt tggggcggcg ccggatttgg atgctaaaaa ttacctggga 
8641 accctgaata catctgggtt gggcgcacaa tgtgtggctc cccacacatc tttaggaaca 
8701 catttgggca acccggtggg agtgaacggc ctggc 



SEQ ID NO 2 

Classic mRNA {1581. .1623, 5259. .5412, 5913. .6196, 6317, .6453) 



ATGTGGCTTCTCCTCACTCTCTCCTTCCTGCTGGCATCCACAGCAGCCCAGGATGGTGACJUVGTT6CTGGAAGGTG 

ACGAGTGTGCACCCCACTCCCAGCCATGGCAAGTGGCTCTCTACGAGCGTGGACGCTTTAACTGTGGCGCTTCCCT 

CATCTCCCCACACTGGGTGCTGTCTGCGGCCCACTGCCAAAGCCGCTTCATGAGAGTGCGCCTGGGAGAGCACAAC 

CTGCGCAAGCGCGATGGCCCAGAGCAACTACGGACCACGTCTCGGGTCATTCCACACCCGCGCTACGAAGCGCGCA 

GCCACCGCAACGACATCATGTTGCTGCGCCTAGTCCAGCCCGCACGCCTGAACCCCCAGGTGCGCCCCGCGGTGCT 

ACCCACGCGTTGCCCCCACCCGGGGGAGGCCTGTGTGGTGTCTGGCTGGGGCCTGGTGTCCCACAACGAGCCTGGG 

ACCGCTGGGAGCCCCCGGTCACAAGTGAGTCTCCCAGATACGTTGCATTGTGCCAACATCAGCATTATCTCGGACA 

CATCTTGTGACAAGAGCTACCCAGGGCGCCTGACAAACACCATGGTGTGTGCAGGCGCGGAGGGCAGAGGCGCAGA 

ATCCTGTGAGGGTGACTCTGGGGGACCCCTGGTCTGTGGGGGCATCCTGCAGGGCATTGTGTCCTGGGGTGACGTC 

CCTTGTGACAACACCACCAAGCCTGGTGTCTATACCAAAGTCTGCCACTACTTGGAGTGGATCAGGGAAACCATGA 

AGAGGAACTGACTATTCTAGCCTATCTCCTGTGCCCCTGACTGAGCAGAAGCCCCCACAGCTGGCCAGCAGCCCCG 

CCTGACATGGAACAGAACGGAGCCATCCCCCAAGACCCTGTCCAAGGCCCAGATGTTAGCCAAGGACTTGTCCCAC 

CTGAGGACAAAGCTGGCGCTCAAGGTCACCTGTTTAATGCCAAGATAACAAAGCGCTGATCCAAGTTGCTCTGTAG 

GAATTTCTGTGACTTTTTTCTGGGGTCAAAGAGAAACCCCGAGACACTGTACACTGTTCCTTTTCACCCACCACCC 

CGATCCCTAGGTGAGGAGAAGCGGCTTGAAGCAGGGCTCCATTCATTCAACACACATGACCACCCGTGTGATCTTG 

AACAAGAGGCCCAATCTCACTTCGCCTTGGTTTCCTTATCTGTAAAATGAGACCATCTTATTGCT 

GCTGTTGTGAGGATTAAATGAGATGATTCGTCTGAACTGATTAAAATCGTGTCTGGCACTGA 

SEQ ID NO 3 

KLK 15 mRNA SPLICE VARIANT 1 structure (1581.. 1623, 5259.. 5412, 5913. .6078, 

'6317.. 6453) 

ATGTGGCTTCTCCTCACTCTCTCCTTCCTGCTGGCATCCACAGCAGCCCAGGATGGTGACAAGTTGCTGGAAGGTG 

ACGAGTGTGCACCCCACTCCCAGCCATGGCAAGTGGCTCTCTACGAGCGTGGACGCTTTAACTGTGGCGCTTCCCT 

CATCTCCCCACACTGGGTGCTGTCTGCGGCCCACTGCCAAAGCCGCTTCATGAGAGTGCGCCTGGGAGAGCACAAC 

CTGCGCAAGCGCGATGGCCCAGAGCAACTACGGACCACGTCTCGGGTCATTCCACACCCGCGCTACGAAGCGCGCA 

GCCACCGCAACGACATCATGTTGCTGCGCCTAGTCCAGCCCGCACGCCTGAACCCCCAGTGAGTCTCCCAGATACG 

TTGCATTGTGCCAACATCAGCATTATCTCGGACACATCTTGTGACAAGA6CTACCCAGGGCGCCTGACAAACACCA 

TGGTGTGTGCAGGCGCGGAGGGCAGAGGCGCAGAATCCTGTGAGGGTGACTCTGGGGGACCCCTGGTCTGTGGGGG 

CATCCTGCAGGGCATTGTGTCCTGGGGTGACGTCCCTTGTGACAACACCACCAAGCCTGGTGTCTATACCAAAGTC 

TGCCACTACTTGGAGTGGATCAGGGAAACCATGAAGAGGAACTGACTATTCTAGCCTATCTCCTGTGCCCCTGACT 

6AGCAGAAGCCCCCACAGCTGGCCAGCAGCCCCGCCTGACATGGAACAGAACGGAGCCATCCCCCAAGACCCTGTC 

CAAGGCCCAGATGTTAGCCAAGGACTTGTCCCACCTGAGGACAAAGCTGGCGCTCAAGGTCACCTGTTTAATGCCA 

AGATAACAAAGCGCTGATCCAAGTTGCTCTGTAGGAATTTCTGTGACTTTTTTCTGGGGTCAAAGAGAAACCCCGA 

GACACTGTACACTGTTCCTTTTCACCCACCACCCCGATCCCTAGGTGAGGAGAAGCGGCTTGAAGCAGGGCTCCAT 

TCATTCAACACACATGACCACCCGTGTGATCTTGAACAAGAGGCCCAATCTCACTTCGCCTTGGTTTCCW 

TAAAATGAGACCATCTTATTGCTGACTTCAAAGGGCTGTTGTGAGGATTAAATGAGATGA 



SEQ ID NO 4 
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KLK 15 mRNA SPLICE 2 structure (1581.. 1623, 5259,. 5412, 5913.. 6196, 
7127.. 7786) 

ATGTGGCTTCTCCTCACTCTCTCCTTCCTGCTGGCATCCACAGCAGCCCAGGATGGTCACAAGTTGCTGGAAGGTC 
ACGAGTGTGCACCCCACTCCCAGCCATGGCAAGTGGCTCTCTACGAGCGTGGACGCTTTAACTGTGGCGCTTCCCT 
CATCTCCCCACACTGGGTGCTGTCTGCGGCCCACTGCCAAAGCCGCTTCATGAGAGTGCGCCTGGGAGAGCACAAC 
CTGCGCAAGCGCGATGGCCCAGAGCAACTACGGACCACGTCTCGGGTCATTCCACACCCGCGCTACGAAGCGCGCA 
GCCACCGCAACGACATCATGTTGCTGCGCCTAGTCCAGCCCGCACGCCTGAACCCCCAGGTGCGCCCCGCGGTGCT 
ACCCACGCGTTGCCCCCACCCGGGGGAGGCCTGTGTGGTGTCTGGCTGGGGCCTGGTGTCCCACAACGAGCCTGGG 
ACCGCTGGGAGCCCCCGGTCACAAGGGTGACTCTGGGGGACCCCTGGTCTGTGGGGGCATCCTGCAGGGCATTGTG 
TCCTGGGGTGACGTCCCTTGTGACAACACCACCAAGCCTGGTGTCTATACCAAAGTCTGCCACTACTTGGAGTGGA 
TCAGGGAAACCATGAAGAGGAACTGACTATTCTAGCCTATCTCCTGTGCCCCTGACTGAGCAGAAGCCCCCACAGC 
TGGCCAGCAGCCCCGCCTGACATGGAACAGAACGGAGCCATCCCCCAAGACCCTGTCCAAGGCCCAGATGTTAGCC 
AAGGACTTGTCCCACCTGAGGACAAAGCTGGCGCTCAAGGTCACCTGTTTAATGCCAAGATAACAAAGCGCTGATC 
CAAGTTGCTCTGTAGGAATTTCTGTGACTTTTTTCTGGGGTCAAAGAGAAACCCCGAGACACTGTACACTGTTCCT 
TTTCACCCACCACCCCGATCCCTAGGTGAGGAGAAGCGGCTTGAAGCAGGGCTCCATTCATTCAACACACATGACC 
ACCCGTGTGATCTTGAACAAGAGGCCCAATCTCACTTCGCCTTGGTTTCCTTATCTGTAAAATGAGACCATCTTAT 
TGCTGACTTCAAAGGGCTGTTGTGAGGATTAAATGAGATGA 



SEQ ID NO 5 

KLK 15 mRNA SPLICE 3 structure (1581.. 1623, 5259.. 5412, 5913.. 6078, 
7127.. 7786) 

ATGTGGCTTCTCCTCACTCTCTCCTTCCTGCTGGCATCCACAGCAGCCCAGGATGGTGACAAGTTGCTGGAAGGTG 
ACGAGTGTGCACCCCACTCCCAGCCATGGCAAGTGGCTCTCTACGAGCGTGGACGCTTTAACTGTGGCGCTTCCCT 
CATCTCCCCACACTGGGTGCTGTCTGCGGCCCACTGCCAAAGCCGCTTCATGAGAGTGCGCCTGGGAGAGCACAAC 
CTGCGCAAGCGCGATGGCCCAGAGCT^CTACGGACCACGTCTCGGGTCATTCCACACCCGCGCTACGAAGCGCGCA 
GCCACCGCAACGACATCATGTTGCTGCGCCTAGTCCAGCCCGCACGCCTGAACCCCCAGGGTGACTCTGGGGGACC 
CCTGGTCTGTGGGGGCATCCTGCAGGGCATTGTGTCCTGGGGTGACGTCCCTTGTGACAACACCACCAAGCCTGGT 
GTCTATACCAAAGTCTGCCACTACTTGGAGTGGATCAGGGAAACCATGAAGAGGAACTGACTATTCTAGCCTATCT 
CCTGTGCCCCTGACTGAGCAGAAGCCCCCACAGCTGGCCAGCAGCCCCGCCTGACATGGAACAGAACGGAGCCATC 
CCCCAAGACCCTGTCCAAGGCCCAGATGTTAGCCAAGGACTTGTCCCACCTGAGGACMAGCTGGCGCTCAAGGTC 
ACCTGTTTAATGCCAAGATAACAAAGCGCTGATCCAAGTTGCTCTGTAGGAATTTCTGTGACTTTTTTCTGGGGTC 
AAAGAGAAACCCCGAGACACTGTACACTGTTCCTTTTCACCCACCACCCCGATCCCTAGGTGAGGAGAA6CGGCTT 
GAAGCAGGGCTCCATTCATTCAACACACATGACCACCCGTGTGATCTTGAACAAGAGGCCCAATCTCACTTCGCCT 
TGGTTTCCTTATCTGTAAAATGAGACCATCTTATTGCTGACTTCAAAGGGCTGTTGTGAGGATTAAATGA^ 



SEQ ID NO 6 
KLK15 protein 

MWLLLTLSFLLASTAAQDGDKLLEGDECAPHSQPWQVALYERGRFNCGASLISPHWVLSAAHCQSRFMRV^ 
LRKRDGPEQLRTTSRVIPHPRYEARSHRNDIMLLRLVQPARLNPQVRPAVLPTRCPHPGEACWSGWGLVSHNEP 
TAGSPRSQVSLPDTLHCANISIISDTSCDKSYPGRLTNTMVCAGAEGRGAESCEGDSGGPLVCGGILQGIVSWGDV' 
PCDNTTKPGVYTKVCHYLEWIRETMKRN 



SEQ ID NO 7 

KLK15 splice variant 1 

MWLLLTLSFLLASTAAQDGDKLLEGDECAPHSQPWQVALYERGRFNCGASLISPHWVLSAAHCQSRFMRVRLGEHN 
LRKRDGPEQLRTTSRVIPHPRYEARSHRNDIMLLRLVQPARLNPQ 



SEQ ID NO 8 

KLkl5 splice variant 2 

lOTliLLTLSPLLASTAAQDGDKLLEGDECAPHSQPWQVALYERGRFNCGASLISPHWVLSAAHCQSRFMRVRLG 

LRKRDGPEQLRTTSRVIPHPRYEARSHRNDIMLLRLVQPARLNPQVRPAVLPTRCPHPGEACWSGWGLVSHNEPG 

TAGSPRSQG 
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SEQ ID NO 9 



KLkl5 splice variant 3 

MWLLLTLSFLLASTAAQIXSDKLLEGDECAPHSQPWQVALYERGRFNCGASLISPHVmjSiUmCQSRFM^ 

LRKRDGPEQLRTTSRVIPHPRYEARSHMTOIMLIiRLVQPARIiNPQGD 

VYTKVCHYLEWIRETMKFN 



SEQ ID NO 10 

HNEPGTAG 

SEQ ID NO 11 

5 ' Un trans 1 a t ed 

1-1580 

AGAATGGGTGCTGTGGGATTCAGGGGAGACACCTGTTAGGTGTTGGGGCC 
TCCCAGAAGAGGTGGGGGCAGAGTGTCAGAGGACAAAGATGAATTTGGAA 
GATATGGGGAAGAAGGATTTCAATTCACCCTCAAAGCTTCCTGAGGCCTC 
CCGTGGGTCGGGCCCTGCAGTACTGGAGACCCAGAGTGGAGTCAGACCAG 
CTCCTCGGGGAGCTGCCAGTCTCGTAGGGGAGGCAGACACCACTGAGGGT 
CAGGGGAGGTCAGAGAAGGCCTCAAGGAGGAAGCGGGGCTGGAAGGGAAT 
GGCGTTGGATATGCGGTGGGAGGAATAGCCTAAGCATGAAATGGCAGGAG 
GGAAAATGGCAGCACTGGCTGCGTCTAGGACAAGGTCATGGGAGACCCAG 
GGAGAGGGGCTGGAAGGGAAGAAGCCACTTTTGTCCTTGAAAGTGAGGCT 
GGAGCCAGGCAACTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGC 
GG6TGGATCACTAGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGT 
GAAACTCCGTCTCTACTAAAATTACAAAAATTAGCTGGGCGTGGTGGCAC 
ACACCTGTAATCCCAATTGCTTGGGAGGCTGAGGCAGGAGAATCTCTTGA 
ACCCAGAAGGCAGAGGTTACAGTGAGCGGAGATCACGCCACTCCACTCCA 
ACCTGGGCTACAGAGCCAGACTCCGTCTCAAAAAAAAAAAAAAAAAAGAA 
AAAAAAAGAAAGAAAGTGAATTTGT^GAGCTGGACTTTATCCTGGTGGTG 
CCAAGGATCCATGGAGGGTGGTGAGCAGGGGAGGGGCACAGCCAGCTCCA 
GATGTAGAAAGACCCTTTGGGGTCATGGCTGGAGGGCAAGCTGGTGGAGG 
GGACTGGACTGGAGGGGGACCCAAAAGGCCAGATAAGAGGGTTGAGATAG 
ACCAGGCGCGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAG 
GTGGGTGGATCATGAAGTCAAGAGATTGAGGCCATCCTGGCTAACACGGT 
GAAACCCTGTCTCTACTTAAAAAAAAAAAATTTCCAAAAAATTAGCCGGG 
CACGGTGGTGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCGGGA 
GAATGGTGTGAACCTGGGAGGTGGAGCTTGCAGTGAGCCGACATTGTGCC 
ACTGCACTCCAGCCTGGGTGACAGAGTGAGACTCCGTCTCAAAAAAATAA 
AAAAAGTTGGGACAGGGGGTCCTTGCGTGATGATGGAGAGAGATCCACCC 
GCTGGTAGCATGGTGCTGGAGGCTGACAGGTGGAGGAGGTGGGGCAGGGT 
CTGTCCGAGTGCCTAGAGGAAGAGTAAACCTTCCAGAGATGGGGGACCCA 
GAAGGAAGCGCAGAGTGGGGTTGGGGGAAGGGGATACCGGTGGTCAGAAG 
AAATTTATTAACAGTGGATGGGATAAGTCTGTGTCTGGAGGGATCCTGGT 
GGAGGCAGAAGGGTCCTGCCTCACCTGGATTCTCTCACTCCCTCCCCAGA 
CTGCAGCCGAACCCTG6TCCCTCCTCCAGA 

SEQ ID NO 12 

Exon 1 

1581-1623 

ATGTGGCTTCTCCTCACTCTCTCCTTCCTGCTGGCATCCACAG 
SEQ ID NO 13 



Intron 1 



% 



wo 02/14485 



PCT/CAOl/01141 



6/12 

1624-5258 

GTGAGGTGGCCCCAGGAGGGGGCCAGGTCTGTGGGAGCAGGTGCCCCCTTCCCAAGCATGTCTGGGCCCAGTGATC 

TGCCAGCCCCTACCTCACCCAGAGACCACTAAAGATCCTTCCTTCACCCTCCACCT6TGCCAATGTCCCTAAGCCC 

TTACCGTCAGGTGCTGGTGCTGCTGCTCTGGAGTCGCTATGTTGCCTGGGGCCTCTCGCTGCCCACGACAAGGAAC 

ACGGTCCTGGGGTTACACAAACCTGAGCTGAGTCCTGGGGCAACCGCTTCCTTGCTGTGTGTCCTTGAGGGAACTG 

CTTCACCTCTCTGGGCTTCGAATGCCTTCTCTATAAGACAGCACCCACTTGAGACAATAACAGTGAGGTCTCAATA 

GCATAACAGAGGTAATATACATAGCAAGCATTAGACAAGTGCTGAGAGGCCAACAGCACAGACAGACTCCAGCTTG 

AGTCCCACACCTGCCACTCCCTGTCTCTTACAGGGTCTTTGAGGGGATTAAATGTGGTTGTGTGTGAGGCAGAAGC 

ATAAGCCTGGCCCAGGTAGTGCCCCTTCAGGTGTGCAAGCCAGGOVCGGTGCTTAGAGCTTACATACAACGTCTAT 

GTGTGGTGGGCACCACCGACCTCATTTGACAAGGGAA6GGGCTGTGGCTCAGAGGGACGGCCACAACATCAAGGTC 

ACCTTGGGTGTCAGGCAAACTCCAGATTGAACTCAGCTGCCACACACCAAGAAATTAATTGTAACCTGATGCCTCT 

CTTCTGGAGAAATTGGGGGGTGGACTTTCATTAACGTTCTGCCACAAATGACCCTCACTCCTGGGGGCCCCTGAGA 

CCCCCACGCCTCCAGCCTCCCCTCCGGCTCTCTCTGTGCACTCACCTACCTGCCTCGCGCCTGCCTGCTGCGCCCA 

GCTGGGGCCTCCACCTTCCTCTGGCTTGGACTGGCCAGGTGCAGCCTCGGTGCCCAGCTGTTCAGCCCGTACCCTC 

CGCCCTTCGGAGGACGACCTCACCCTTCCTTTGTTAAGCCCCTTGTCCACCACATCCGCATTCCCCTGGTCTCACG 

GGGGCCTTTGGCCCAGTTCCTGACTOTGATGGGGAGAGTGTGGGCATTTGGTCTGGCTGTGC^^ 

TGTGGGTGGGAGTGTGCATGGCTTCAACCTTCAGGGGATGCATCCACATTGCCCAGTGGAGAGGGGTCCTGGTCCT 

GTGACCTTGAATGTCTCTAATCATGTCCTTAAGCATAATGCCATTCTGTGTGTGTGTGTGTGTGTGTGTGTGTACA 

TGCACGTGTGCAGTGGGTATACAAGGCCCTGTATGTTCACATCCTCTCCACATGCATGAGCCAGATCCCCATATGT 

GAAACCCAATCAGTGACTCCACAGATCTGGCTTGGGGGCTGATCTAGAGATGGATAAATATGTCCTGCCCTGGCTG 

CCTCTGGCTTCAGCTGCATGTCTTTGACCTTGAATGCCCAGCCCCGTGTCTGGGTGCTGCCCCAGACAGCAAGTCC 

ACATCTGAGTGTTGGCCTTCTGGGTTGGTGTCTGCAGCTCTAACTCTACAAAATGTCTTGTGGGTGAATCACGGTT 

TTAACCTTGACTTTTTTTTGTTTGTTTGGTTTTTTTTGAGACGGAGTC'^ 

TGGTGCAACCTCAGCTCACTGCAACCTCCGCCTCCCAGGTTCAAGCAATTCTGTCTCTGCCTCCCGAGTAGCTAGA 

ATTACAGGCACGCACCACCACGCCCAGCTGATTTTTGTATTTTTATTTATTTATTTATTTATTTTOT 

GGGATTTCACGATGTTGGCCAGGCTGGTCTCAAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCT^^ 

CAAAGTGCTGGGATTACAGGCGTGAGCCACCACACCTGGCCAACCTTGACTATTTATTATAGGTAATTCTGTGCAG 

ATGTCTGACTTATGTTGGCCATCTCCAGGATGGACCTGAACTTTCACACGTATGTCCCTGTGACTAAATCCAGGTG 

TCATTTGCAT^AAAACAACTAATATTATTAAGTAGCTACCAGGGCTAGGTATCACTCACCATACATACACACATGCA 

CACACACACATACACATTCCTACCTCATCCTTACAACAATCTTCATTTTACAGATGAGGAAACAGAGGCACAGACA 

GGTCGAATAACTTACTCAAAGTTTCACAGCTAGTACATTCGAACCCAGGCTTAAGGACCCATCTTTGTCCAGACCC 

TGTATGCAAGTGTCTGTGACACTGGATGCCAAGACTCACACTAGAGATGTTGAATTTAGGTCTGAACAATATCCAA 

TTCTGTGTGTGTGTTTGTGTGTGCATGTGTGTGTGTGTATGTATTCATGTCTTAACCATCCATAOT 

GCTGGAGTGCAATGGAGCAACCTCCGCTCACTGCAATCTCCGCCTCCCGGGTTCAAGCGATTTTCCTGCCTCAGCC 

TCCAGAGTAGCTGGGATTACAGGCACCCGCCACCATGCCCAGCTAATTTTTTGTATTTTTGTTAGAGACAGGGTTT 

CCCCATATTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCCACCCGCCTCGGCCTCCCAAAGTGCTGGGA 

TTACAGGCATGAGCCACCGTGCCCAGCCTGTGCTGTGATTCTTGAAGCTGCAACCCATGTGCATGCAAGT^ 

CAGCTTCCAGTCCTGTCCATAGCTGTACCTAAGTGTGGAAGCTGGATGTGCATGTATGCATGTCCATGACCTTGTA 

TAGCCACATCTGGGACTCATACTGCACACTGAATTTGGCTGACATGTCCAGACTCTGGGGCCAAGGCTGGGTCACA 

CATACTGAGTGGCCACATGCGTTTGACGTCTGTGACAATTTGGTGACCGTGAATGACTGGTTTCAAGTGACCACCT 

GTCTGAACCTGTATCCAGTGCCCCTGTCTCCACCCCCAACCACAGAGGACTTCTTGCCCTCTGGTCTGTTCCCCTT 

CCTCTCTCTCCCAGAGTCTTATAGCAAATGGGGTGGGGGCTAGAGTTCTGGAGAAAACAGGCAGCGGTTGTAAATA 

AACAACAGGGCAGGCGGAGCATGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGAGCAT 

TTGAAGTCAGAAGTTTGAGACTACCTGGCTAACATGGTGAGACCTCGTCTCTACTAAAAATACAAAAATTAGCCAG 

GTGTGGTGGCGGGCACCTCAGCTACTCGGGAGGCTGAGGCAGGAGGATCACTTGAACCCAG6AGGCGGAAGTTGCA 

GTGAGCTGAGATCATGCCACTGCACTCCAGCCTGGGCAAAAGAGTGAGACTCCGTCTCAAAAACAACAACAACAAC 

AAAACAAAAAACAGGGCAGGGTGTCTTGAGAAGTTAGGGGAAAGGCATAGGCATATAGTAGTTAGGGCAGGGTGCA 

AGGAASGTGTAGGAGGCAATGTAAACGTCCCTGTCCTCAGGCATCCTCTACCCCTTCTCTTAG 

SEQ ID NO 14 

Bxon 2 

5259-5412 

CAGCCCAGGATGGTGACAAGTTGCTGGAAGGTGACGAGTGTGCACCCCACTCCCAGCCATGGCAAGTGGCTCTCTA 
CGAGCGTGGACGCTTTAACTGTGGCGCTTCCCTCATCTCCCCACACTG6GTGCTGTCTGCGGCCCACTGCCAAAGC 
CG 

SEQ ID NO 15 
Intron 2 



5413-5912 
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GTATGAAGGCAGGGGCTCAGGGTCCTGAGGGAGCCTGGTTCGGGGGGAAGAGCTCCTAGATTT^ 

GGCAGACGCCAGAACTCCTGGGTTCTGAAAGACGAGGAGGCCGGATGTCAAGCCCCTGGGTTAGGAAGGAGTGTGT 

GTTTCAAAGCCTTCGATCTCTGAAGGAGGAAGGAGAAGACTAGTTCCAGCTTTTGAGCCTCAGTTCTAGGGATGTG 

AGAATCCTGGATTCGGGGACAGACCAGGAGGGGGCTGGGAGTAGTTGGAGGGGATCGAGTTCTAGGAGTGTGCCTG 

ACTTCAGACTCGTTGGTCCTTGAGGAGCAGGGGCTGGAACCATTGGCTTCAGGGTCTTGGGAAAAGGTA^ 

GTCGAGATTTCTAAAGGGTCGGGAGACCTCGGGTTGCCCACTCTTTGATCTTTCTGTCCTC 

ACTGGCCCGCACTCCACTGGCGGGAAAACCACTCGCCCGCACAG 



SBQ ID NO 16 

Exon 3 (Classic and Splice Variant 2) 



5913-6196 



cttcatga gagtgcgcct gggagagcac aacctgcgca agcgcgatgg cccagagcaa ctacggacca 
cgtctcgggt cattccacac ccgcgctacg eiagcgcgcag ccaccgcaac gacatcatgt tgctgcgcct 
agtccagccc gcacgcctga acccccaggt gcgccccgcg gtgctaccca cgcgttgccc ccacccgggg 
gaggcctgtg tggtgtctgg ctggggcctg gtgtcccaca acgagcctgg gaccgctggg 
agcccccggt cacaag 

SEQ ID NO 17 

Intron 3 - Classic and Spice Varicuit 2 
6197-6316 



tgc gtgaaaggat ggagctggat gcgaggcctc aaggaatccta tgctccaggg ctcttgggcg 
gaggggaca agggccggaa tttatggatc. tgctccaagtccactgtctt ccccag 



SEQ ID NO 18 

Exon 3 - (Splice variant 1 and 3) 
5913-6078 

CTTCATGAGAGTGCGCCTGGGAGAGCACAACCTGCGCAAGCGCGATGGCCCAGAGCAACTACGGACCACGTCTCGG 
GTCATTCCACACCCGCGCTACGAAGCGCGCAGCCACCGCAACGACATCATGTTGCTGCGCCTAGTCCAGCCCGCAC 

CTGGGGCCTGGTGTCCCACAACGAGCCTGGGACCGCTGGGAGCCCCCGGTCACAAG 
SEQ ZD NO 19 

Intron 3 - (Splice variant 1 and 3) 
6079-6316 

GTGCGTGAAAGGATGGAGCTGGATGCGAGGCCTCAAGGAATCCTATGCTCCAGGGCTCTTGGGCGGAGGGGACAAG 
GGCCGGAATTTATGGATCTGCTCCAAGTCCACTGTCTTCCCCAG 

SEQ ID NO 20 

Exon 4 

6317-6453 (Classic and Splice Variant 1) 

TGAGTCTCCCAGATACGTTGCATTGTGCCAACATCAGCATTATCTCGGACACATCTT6TGACAAGAGCTACCCAGG 
GCGCCTGACAAACACCATGGTGTGTGCAGGCGCGGAGGGCAGAGGCGCAGAATCCTGTGAG 

SEQ ID NO 21 

Intron 4 (Classic and Splice Variant 1) 
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6454-7126 

GTCAGAGCCTAGAGGGGCCATCAGGCGGAAGAAGAGGGATGGGGACAGGTGTGGGAGTCCGGATGGGGTTGGATTT 

TCTTTGCTTTGGGCCAGAGAAGATGCTAGGGTTAGGCTTGGAGATGGAGTAGGAAGAGAAGTTAGAATAGGGGTGA 

GGTTGGAGTTGGGGTTATAGGTGGGGATTGCGTTGTTTGAGGTGGATAACTGTGATAGTTAGTTTGAGATGGCATG 

GGTTGGGGTTGAGAATGGGAATGGTTTGGTTTGATTCTGGGTGGGAAATACGTCAGGGTTGAATTGGGATGAGGT^ 

GATTTTGTTTGGAATGCAGAAGACATGAAGATTGAGATTGGATTTTGAGATGGGCATGGGT^ 

ATGGTGAGGATGTGGGCTGAGTTGGATTTAACTTAGTACAGTTGCACTGGAGTTGCATGGG^ 

AGGTTGGGTGAGTTGTATTGAGCTGTGTTGAATTGGGGTTGGGGTTGGGGTTGGGTTGGCTCTGTT^ 

TGGGCTGTATTGAGTTGAGTTGGGTTGGGGTTCCCTGGGATGGGGATGGATTGGGTTTGGGGTGAGATT6CAAATG 

GTGATTAGGATGAGGATGAATCCAGGAGGTTTCACTCAACCTGAGACCCCCTCTTTTCCCCACAG 



SBQ ID NO 22 

Intron 4 (Splice variant 2 and 3) 
6079-7126 

T GCGCCCCGCG GTGCTACCCA CGCGTTGCCC CCACCCGGGG GAGGCCTGTG TGGTGTCTGG 
CTGGGGCCTG GTGTCCCACA ACGAGCCTGG GACCGCTGGG AGCCCCCGGT CACAAGGTGC GTGAAAGGAT 
GGAGCTGGAT GCGAGGCCTC AAGGAATCCTATGCTCCAGG GCTCTTGGGC GGAGGGGACA AGGGCCGGAA 
TTTATGGATC TGCTCCAAGT CCACTGTCTT CCCCAGTGAG TCTCCCAGAT ACGTTGCATT GTGCCAACAT 
CAGCATTATC TCGGACACAT CTTGTGACAA GAGCTACCCA GGGCGCCTGA CAAACACCAT GGTGTGTGCA 
GGCGCGGAGG GCAGAGGCGC AGAATCCTGT GAGGTCAGAG CCTAGAGGGG CCATCAGGCG GAAGAAGAGG 
ATGGGGACA GGTGTGGGAG TCCGGATGGG GTTGGATTTT CTTTGCTTTG GGCCAGAGAA GATGCTAGGG 
TTAGGCTTGG AGATGGAGTA GGAAGAGAAG TTAGAATAGG GGTGAGGTTG GAGTTGGGGT TATAGGTGGG 
GATTGCGTTG TTTGAGGTGG ATAACTGTGA TAGTTAGTTT GAGATGGCAT GGGTTGGGGT TGAGAATGGG 
AATGGTTTGG TTTGATTCTG GGTGGGAAAT ACGTCAGGGT TGAATTGGGA TGAGGTAGAT TTTGTTTGGA 
ATGCAGAAGA CATGAAGATT GAGATTGGAT TTTGAGATGG GCATGGGTTT GATTTGATTT TGAATGGTGA 
GGATGTGGGC TGAGTTGGAT TTAACTTAGT ACAGTTGCAC TGGAGTTGCA TGGGGGTGAG ATTGGATATA 
GGTTGGGTGA GTTGTATTGA GCTGTGTTGA ATTGGGGTTG GGGTTGGGGT TGGGTTGGCT CTGTTTGGGA 
TAAACTGGGC TGTATTGAGT TGA6TTGGGT TGGGGTTCCC TGGGATGGGG ATGGATTGGG TTTGGGGTGA 
GATTGCAAAT GGTGATTAGG ATGAGGATGA ATCCAGGAGG TTTCACTCAA CCTGAGACCC CCTCTTTTCC 
CCACAG 



SEQ ID NO 23 
Exon 5 
7127-7786 

GGTGACTCTGGGGGACCCCTGGTCTGTGGGGGCATCCTGCAGGGCATTGTGTCCTGGGGTGACGTCCCTTGTGACA 
ACACCACCAAGCCTGGTGTCTATACCAAAGTCTGCCACTACTTGGAGTGGATCAGGGAAACCATGAAGAGGAACTO 
ACTATTCTAGCCTATCTCCTGTGCCCCTGACTGAGCAGAAGCCCCCACAGCTGGCCAGCAGCCCCGCCTGACATGG 
AACAGAACGGAGCCATCCCCCAAGACCCTGTCCAAGGCCCAGATGTTAGCCAAGGACTTGTCCCACCTGAGGACAA 
AGCTGGCGCTCAAGGTCACCTGTTTAATGCCAAGATAACAAAGCGCTGATCCAAGTTGCTCTGTAGGAATTTCTGT 
GACTTTTTTCTGGGGTCAAAGAGAAACCCCGAGACACTGTACACTGTTCCTTTTCACCCACCACCCCGATCCCTAG 
GTGAGGAGAAGCGGCTTGAAGCAGGGCTCCATTCATTCAACACACATGACCACCCGTGTGATCTTGAACAAGAGGC 
CCAATCTCACTTCGCCTTGGTTTCCTTATCTGTAAAATGAGACCATCTTATTGCTGACTTCAAAGGGCTGTTGTGA 
GGATTAAATGAGATGATTCGTCTGAACTGATTAAAATCGTGTCTG6CACTGA 



SEQ ID NO 24 
7127-7279 

GGTGACTCTGGGGGACCCCTGGTCTGTGGGGGCATCCTGCAGGGCATTGTGTCCTGGGGTGACGTCCCTTGTGACA 
ACACCACCAAGCCTGGTGTCTATACCAAAGTCTGCCACTACTTGGAGTGGATCAGGGAAACCATGAAGAGGAACTO 
A 



SEQ ID NO. 25 
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Zyme 

MKKLMVVLSLIAAAWAEEQim>VHGGPCDKTSHPYQAALYTSGHLL^ 

LRQRE S SQEQS SVVRAVIHPDYDAASHDQDIMLLRr*ARPAKLSELIQPLPLERIX:S 

DTIQCAYIHLVSREECEHAYPGQITQMMLCAGDEKYGKDSCQGDSGGPLVCGDHLRGLVSWGNIPCGSKEKPGVYT 
NVCRYTNWIQKTIQAK 



SEQ ID NO. 26 
KLK-L4 

MWPLALVIASLTLALSGGVSQESSKVLNTWGTSGFLPGGYTCFPHSQPWQAAL^^^ 
CLKEGLKVYLGKHAIXSRVEAGEQVREVVHSIPHPEYIUlSPTHLlSfHDHDIMLLELQSPVQL^^^ 
PGTTCRVSGWGTTTSPQVNYPKTLQCi^IQLRSDEECRQVYPGKITDNMLC^^ 
YGrVSWGDFPCGQPDRPGVYTRVSRyVLWIRETIRKYETCKXJKWL^^ 

SEQ ID NO. 27 

KLK-L6 

MFLLLTALQVLAIAMTQSQEDENKIIGGHTCTRSSQPWQAALLA 

GPRRRFLCGGALIiSGQWITAAHCGRPILQVALGKHNLRRWEATQQVLRVVRQVTHPNYNSRTHDl^ 

ARIGRAVRPIEVTQACASPGTSCRVSGWGTISSPIARYPASLQCVNINISPDEVCQKAYPRTITPGMVCA6VPQGG 

KDSCQGDSGGPLVCRGQLQGLVSWGMERCALPGYPGVYTNLCKYRSWIEETMRDK 

SEQ ID NO. 28 

TLSP 

MQRLRWLRDWKSSGRGLTAAKEPGARSSPLQAMRILQLILLALATGLVGGETRIIKGFECKPHSQPWQAALFEKTR 
LLCGATLIAPRWLLTAAHCLKPRYrVHIiGQHNLQKEEGCEQTRTATESFPHPGFNNSLPNKDHRNDI^ 
SITWAVRPLTLSSRCVTAGTSCLISGWGSTSSPQI*RLPHTLRCANITIIEHQKCENAYPGNITDTMVC^ 
DSCQGDSGGPLVCNQSLQGIISWGQDPCAITRKPGVYTKVCKYVDWIQE 



SEQ ID NO. 29 
KLK-L3 

MKLGLLCALLSLrAGHGWADTRAIGAEECRPNSQPWQAGLFHLTRLFCGATLISDRWLLTAAHCRK^ 
HLWKWEGPEQLFRWDFFPHPGFNKDLSANDHNDDIMLIRLPRQTUILSPAVQPLNLSQTOTSPGMQCLISG^ 
SPKALFPVTLQCANISILENKLCHWAYPGHISDSMLCAGLWBGGRGSCQGDSGGPLVCNGTLAGW 
RRPAVYTSVCHYLDWIQEIMEN 



SEQ ID NO. 30 



NESl 



MRAPHLHLSAASGARALAKLLPLLMAQLWAAEAALLPQNDTRLDPEAYGAPCARG SQPWQVSLFNGLSFH 
CAGVLVDQSWVLTAAHCGNKPLWARVGDDH LL-LLQG-EQLRRTT RSWHPKYHQGSGPI LPRRTDEHDLML 
LKLARPW-PGPRVR ALQLPYR-CAQPGDQ CQVAGWGTTAARRVK YNKGLTCSSITILSP 
KECEVFYPGWTNNM ICAGLDR-GQDPCQS DSGGPLVCDETLQGI LSWG- 
VYPCGSAQHPAVYTQICKYMSWINK VIRSN 



SEQ ID NO. 31 
KLK-L5 



MGLSIFLLLCVLGLSQAATPKIFNGTECGRNSQPWQVGLFEGTSLRCGGVLIDHRWVLTAAHCSGSRYWVRLGEHS 
LSQIiDWTEQIRHSGFSWHPGYLGASTSHEHDLRLLRLRLPVROTSSVQPLPLPNDCATAGTECHVSGWGITNHPR 
NPFPDLLQCLNLSIVSHATCHGVYPGRITSNMVCAGGVPGQDACQGDSGGPLVCGGVLQGIjVSWGSVGPCGQDGIP 
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GVYTyiCKYVDWIRMIMRNN 



SEQ ID NO. 32 
Neuropsin 

-MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQPWQAALFQGQQLLCGGVLVGGNV^ 
PKYTVRIXSDHSLQNKDGPEQEIPWQSIPHPCYNSSDVEDHOTIDLMLLQLiyDQASLGSK^ 
TOSGWGTVTSPRENFPDTLNCAEVKIFPQKKCEDAYPGQITDGWCAGSSKGAiyrcQ^ 
GSDPCGRSDKPGVYTNICRYLDWIKKI IGSKG 

SEQ ID NO. 33 

PSA 

MWVPVVFLTLSVTWIGAAPLILSRIVGGVraCEKHSQPWQVLVASRGRAVCGGVLVHPQWVLT 
RHSLFHPEDTGQWQVSHSFPHPLYDMSLLKNRFLRPGDDSSHDLMLLRLSEPAELTDAVKVm 
YASGWGSIEPEEFLTPKKLQCVDLHVISNDVCAQWPQKVTKFMLCAGRWTGGKSTCSGDSGGPLVCNGTV^ 
WGSEPCALPBRPSIiyTKWHYRKWIKDTIVANP 



SEQ ID NO. 34 
HK2 

imJLVLSIALSVGCTGAVPLIQSRIVGGWECEKHSQPWQVAVYSHGWAHCGGVLVHPQWVLTAAHCLKK^ 
RHNLFEPEDTGQRVPVSHSFPHPLYNMSLLKHQSLRPDEDSSHDimLRLSEPAKITDVVKVLGLPTQEPA^^ 
YASGWGSIEPEEFLRPRSLQCVSLHLLSNDMCARAYSEKVTEFMLCAGLWTGGKDOXrGGDSGGPLVCNGVLQGITS 
WGPBPCALPEKPAVYTKVVHyRKWIKDTIAAKP 



SEQ ID NO. 35 

HKl 

MWFLVLCLALSLGGTGAAPPIQSRWGGWECEQHSQPWQAALYHFSTFQCGGILVHRQWVLTAAHCISDNYQLWLG 
RHNIiFDDENTAQFVHVSESFPHPGFNMSI^EmTRQADBDYSHDIjMLLRLTEPADTITDAVKWELP 
CLASGWGSIEPENFSFPDDLQCVDLKILPNDECKKAHVQK\ra'DFMLCVGHLEGGKDTCVGDSGGPI^ 
SWGYVPCGTPNKPSVAVRVLSYVKWIEDTIAENS 

SEQ ID NO. 36 



KIiK-L2 

MATARPPWMimiCALITALLLGVTEHVIiANNDVSCDHPSNTVPSGSNQDLGAGAGEDARSDDSSSR^ 
TQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKKVFRVRLGHYSLSPVYESGQQMFQGVKSIPHPGYSHPGHSN 
DLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCLVSGWGTTKSPQVHFPKVLQCLNISVLS 
MFCAGDKAGRDSCQGDSGGPWCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQETIQANS 



SEQ ID NO. 37 
prostase 



MATAGNPWGWFLGYLILGVAGSLVSGSCSQIINGEDCSPHSQPWQAALVMENBLFCSGVLVHPQWVLSAAHCFQNS 
YTIGLGLHSLEADQEPGSQMVEASLSVRHPEYNRPLLANDLMLIKLDESVSSDTIRSISIASQCPTAGNSCLVSGW 
GLLANGRMPTVLQCVNVSWSEEVCSKLYDPLYHPSMFCAGGGHDQKDSCNGDSGGPLICNGYLQGLVSFGKAPCG 
QVGVPGVYTNLCKFTEWIBKTVQAS 



SEQ ID NO. 38 
HSCCH 
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MARSLLLPLQILLLSLALETAGEEAQGDKIIDGAPCARGSHPWQVALLSGNQLHCHSCCEGGVLVlSrERWVLTAAHC 
KMNEYTVHLGSDTLGDRRAQRIKASKSFRHPGYSTQTHVNDLMLVKI^ 
GWGTTTSPDVTFPDmCVDVKLISPQDCTKVYKDLLENSMLCAGIPDSKKNACMGDS 
WGTFPCGQPNDPGTOPQVCKFTRWINDTMKKHR 

SEQ ID NO 39 

CACAAC6AGCCTGGGACCGCTGGG 

SEQ ID NO 40 
ATTAAA 

SEQ ID NO 41 
Table 1 KLKl-A 
ATCCCTCCATTCCCATCTTT 

SEQ ID NO 42 
Table 1 KLKl-B 
CACATACAATTCTCTGGTTC 

SEQ ID NO 43 

Table 1 KLK2-A 

AGTGACACTGTCTCAGAATT 

SEQ ID NO 44 

Table 1 KLK2-B 

CCCCAATCTCACGAGTGCAC 

SEQ ID NO 45 
Table 1 E5-A 
GTCGGCTCTGGAGACATTTC 

SEQ ID NO 46 
Table 1 E5-B 
AACTGGGGAGGCTTGAGTC 

SEQ ID NO 47 

Table 1 KLK15-F1 

CTCCTTCCTGCTGGCATCCA 



SEQ ID NO 48 
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Table 1 KLK15-R1 
ATCACACGGGTGGTCATGTG 
SEQ ID NO 49 
Table 1 KLK15-F2 
CAAGTGGCTCTCTACGAGCG 
SEQ ID NO 50 
Table 1 KLK15-R2 
GACACCAGGCTTGGTGGTGT 
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